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^OT.P.rTn.RS T1^9-REI.ftTF.D PROTEIN 

P&MTT.Y ANT TTgRS THEREOF 
5 p;.r^kaT-ound t:he Invention 

The invention relates to a novel secreted protein 

and the gene encoding it. 

Many membrane -associated and secreted proteins. 

for example, cytokines, play a vital role in the 
10 regulation of cell growth, cell differentiation, and a 
variety of specific cellular responses. A number of 
medically useful proteins, including erythropoietin, 
granulocyte-macrophage colony stimulating factor, human 
growth hormone, and various interleukins . are secreted 
15 proteins. Thus, an important goal in the design and 
development of new therapies is the identification and 
characterization of membrane-associated and secreted 
proteins and the genes which encode them. 

Many membrane-associated proteins are receptors 
20 which bind a ligand and transduce an intracellular 

signal, leading to a variety of cellular responses. The 
identification and characterization of such a receptor 
enables one to identify both the ligands which bind to 
the receptor and the intracellular molecules and signal 
25 transduction pathways associated with the receptor, 
permitting one to identify or design modulators of 
receptor activity, e.g., receptor agonists or antagonists 
and modulators of signal transduction. 

o„mn,ar-Y r.f the Tnvention 
30 The present invention is based, at least in part, 

on the discovery of a gene encoding T139. The T139 cDNA 
described below (SEQ ID N0:1) has a 1338 nucleotide open 
reading frame (nucleotides 95-1432 of SEQ ID N0:1; SEQ ID 
NO: 3) which encodes a 446 amino acid protein (SEQ ID 
35 N0:2). This protein includes a predicted signal sequence 
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Of about 26 amino acids (from about amino acid 1 to about 
amino acid 26 of SEQ ID N0:2) and a predicted mature 
protein of about 420 amino acids (from about amino acid 
27 to amino acid 446 of SEQ ID N0:2; SEQ ID NO: 4) . T139 
5 protein possesses a sperm-coating protein (SCP) domain 
(amino acids 47 to 190, of SEQ ID N0:2; SEQ ID NO:5). a 
C-type lectin domain (amino acids 297 to 412. of SEQ ID 
N0:2; SEQ ID NO: 6), and two epidermal growth factor 
(EGF)-like domains (amino acids 232 to 260 of SEQ ID 
10 NO: 2; SEQ ID NO: 7, referred to herein as the "EGFl 

domain" and amino acids 264 to 291 of SEQ ID N0:2; SEQ ID 
N0:8, referred to herein as the "EGF2 domain"). 

The T139 molecules of the present invention are 
useful as modulating agents in regulating a variety of 
15 cellular processes. Accordingly, in one aspect, this 
invention provides isolated nucleic acid molecules 
encoding T139 proteins or biologically active portions 
thereof, as well as nucleic acid fragments suitable as 
primers or hybridization probes for the detection of 
20 T139-encoding nucleic acids. 

The invention features a nucleic acid molecule 
which is at least 45% (or 55%, 65%, 75%, 85%, 95%, or 
98%) identical to the nucleotide sequence shown in SEQ ID 
N0:1, or SEQ ID N0:3, or the nucleotide sequence of the 
25 cDNA insert of the plasmid deposited with ATCC as 
Accession Number (the "cDNA of ATCC 98694"), or a 
complement thereof. The invention features a nucleic 
acid molecule which includes a fragment of at least 300 
(325, 350, 375, 400, 425, 450. 500, 550, 600, 650. 700, 
30 800, 900, 1000, or 1290) nucleotides of the nucleotide 
sequence shown in SEQ ID N0:1, or SEQ ID NO: 3, or the 
nucleotide sequence of the cDNA of ATCC 98694, or a 
complement thereof. 

The invention also features a nucleic acid 
35 molecule which includes a nucleotide sequence encoding a 
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protein having an amino acid sequence that is at least 
45% {or 55%, 65%, 75%, 85%, 95%, or 98%) identical to the 
amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 4, or the 
amino acid sequence encoded by the cDNA of ATCC 98694. 
5 in a preferred embodiment, a T139 nucleic acid molecule 
has the nucleotide sequence shown SEQ ID N0:1, or SEQ ID 
NO: 3, or the nucleotide sequence of the cDNA of ATCC 
98694. 

Also within the invention is a nucleic acid 
10 molecule which encodes a fragment of a polypeptide having 
the amino acid sequence of SEQ ID NO:-2 or SEQ ID NO: 4, 
the fragment including at least 15 (25, 30, 50, 100, 150, 
300, or 400) contiguous amino acids of SEQ ID N0:2 or SEQ 
ID NO: 4 or the polypeptide encoded by the cDNA of ATCC 
15 98694. 

The invention includes a nucleic acid molecule 
which encodes a naturally occurring allelic variant of a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2 or SEQ ID NO: 4 or an amino acid sequence encoded by 
20 the CDNA of ATCC 98694, wherein the nucleic acid molecule 
hybridizes to a nucleic acid molecule comprising SEQ ID 
N0:1 or SEQ ID NO: 3 under stringent conditions. 

Also within the invention are: an isolated T139 
protein having an amino acid sequence that is at least 
25 about 65%, preferably 75%, 85%, 95%, or 98% identical to 
the amino acid sequence of SEQ ID NO: 4 (mature human 
T139) or the amino acid sequence of SEQ ID NO:2 (immature 
human T139) ; and an isolated T139 protein having an amino 
acid sequence that is at least about 85%. 95%, or 98% 
30 identical to the SCP-like domain of SEQ ID N0:2 (e.g., 
about amino acid residues 47 to 190 of SEQ ID N0:2) , C- 
type lectin domain (e.g., about amino acid residues 297 
to 412 of SEQ ID N0:2), and EGF-like domains (e.g., about 
amino acids residues 232 to 260 or 264 to 291 of SEQ ID 
35 N0:2). 
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Also within the invention are: an isolated T139 
protein which is encoded by a nucleic acid molecule 
having a nucleotide sequence that is at least about 65%, 
preferably 75%, 85%, or 95% identical to SEQ ID NO:3 or 
5 the CDNA of ATCC 98694; an isolated T139 protein which is 
encoded by a nucleic acid molecule having a nucleotide 
sequence at least about 65% preferably 75%, 85%, or 95% 
identical to the SCP-like domain encoding portion of SEQ 
ID NO-1 (e.g.. about nucleotides 233 to 665 of SEQ ID 
10 NO-1) , C-type lectin domain encoding portion of SEQ ID 
NO-1 (e.g., about nucleotides 983 to 1330 of SEQ ID 
NO-1) or EGF-like domain encoding portions of SEQ ID 
NO-1 (e.g., about nucleotides 788 to 874 and 884 to 967 
of SEQ ID NO:l); and an isolated T139 protein which xs 
15 encoded by a nucleic acid molecule having a nucleotide 
sequence which hybridizes under stringent hybridization 
conditions to a nucleic acid molecule having the 
nucleotide sequence of SEQ ID NO: 3 or the non-coding 
strand of the cDNA of ATCC 98694. 
20 Also within the invention is a polypeptide which 

is a naturally occurring allelic variant of a polypeptide 
that includes the amino acid sequence of SEQ ID NO: 2 or 
SEQ ID NO: 4 or an amino acid sequence encoded by the cDNA 
of ATCC 98694, wherein the polypeptide is encoded by a 
25 nucleic acid molecule which hybridizes to a nucleic acid 
molecule comprising SEQ ID N0:1 or SEQ ID NO: 3 under 
stringent conditions. 

Another embodiment of the invention features T139 
nucleic acid molecules which specifically detect T139 
30 nucleic acid molecules. For example, in one embodiment, 
a T139 nucleic acid molecule hybridizes under stringent 
conditions to a nucleic acid molecule comprising the 
nucleotide sequence of SEQ ID N0:1, SEQ ID NO: 3, or the 
CDNA of ATCC 98694, or a complement thereof. In another 
35 embodiment, the T13 9 nucleic acid molecule is at least 
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300 (325, 350, 375, 400, 425, 450. 500. 550, 600, 650, 
700, 800, 900. 1000, or 1290) nucleotides in length and 
hybridizes under stringent conditions to a nucleic acid 
molecule comprising the nucleotide sequence shovm in SEQ 
5 ID N0:1, SEQ ID No':3, the cDNA of ATCC 98694, or a 
complement thereof. In a preferred embodiment, an 
isolated T139 nucleic acid molecule comprises nucleotides 
233 to 665 of SEQ ID N0:1, encoding the SCP-like domain 
of T139; nucleotides 983 to 1330 of SEQ ID N0:1, encoding 
10 the C-type lectin domain of T139; or nucleotides 788 to 
874 or 884 to 967 of SEQ ID N0:1, encoding a EGF like 
domain of T139, or a complement thereof. In another 
embodiment, the invention provides an isolated nucleic 
acid molecule which is antisense to the coding strand of 

15 a T139 nucleic acid. 

Another aspect of the invention provides a vector, 
e.g.. a recombinant expression vector, comprising a T139 
nucleic acid molecule of the invention. In another 
embodiment the invention provides a host cell containing 

20 such a vector. The invention also provides a method for 
producing T139 protein by culturing, in a suitable 
medium, a host cell of the invention containing a 
recombinant expression vector such that a T139 protein is 
produced . 

25 Another aspect of this invention features isolated 

or recombinant T139 proteins and polypeptides. Preferred 
T139 proteins and polypeptides possess at least one 
biological activity possessed by naturally occurring 
human T139. e.g.. (D the ability to form protein:protein 

30 interactions with proteins; and (2) the ability to bind a 
T139 ligand. Other activities include: (1) modulation of 
cellular proliferation and (2) modulation of cellular 

differentiation. 

The T139 proteins of the present invention, or 
35 biologically active portions thereof, can be operatively 
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linked to a non-T139 polypeptide (e.g.. heterologous 
amino acid sequences.) to form T139 fusion proteins. The 
invention further features antibodies that specifically 
bind T139 proteins, such as monoclonal or polyclonal 
5 antibodies. In addition, the T139 proteins or 

biologically active portions thereof can be incorporated 
into pharmaceutical compositions, which optionally 
include pharmaceutical ly acceptable carriers. 

In another aspect, the present invention provides 
10 a method for detecting the presence of T139 activity or 
expression in a biological sample by contacting the 
biological sample with an agent capable of detecting an 
indicator of T139 activity such that the presence of T139 
activity is detected in the biological sample. 
15 In another aspect, the invention provides a method 

for modulating T139 activity comprising contacting a cell 
with an agent that modulates (inhibits or stimulates) 
T139 activity or expression such that T139 activity or 
expression in the cell is modulated. In one embodiment, 
20 the agent is an antibody that specifically binds to T139 
protein. In another embodiment, the agent modulates 
expression of T139 by modulating transcription of a T139 
gene, splicing of a T139 mRNA, or translation of a T139 
mRNA. in yet another embodiment, the agent is a nucleic 
25 acid molecule having a nucleotide sequence that is 

antisense to the coding strand of the T139 mRNA or the 
T139 gene. 

In one embodiment, the methods of the present 
invention are used to treat a subject having a disorder 

30 characterized by aberrant T139 expression or activity by 
administering an agent which is a T139 modulator to the 
subject, in one embodiment, the T139 modulator is a T139 
protein. In another embodiment the T139 modulator is a 
T139 nucleic acid molecule. In other embodiments, the 

35 T139 modulator is a peptide, peptidomimetic, or other 
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small molecule. In a preferred embodiment, the disorder 
characterized by aberrant T139 protein or nucleic acid 
expression is a proliferative or dif f erentiative 
disorder, particularly of the immune system. 
5 T139 nucleic acids, polypeptides, and modulators 

of T139 expression or activity can be used to modulate 
spermatogenesis. For example, such molecules can be used 
as a contraceptive to decrease spermatogenesis. 
Alternatively, such molecules can be used to increase 
10 spermatogenesis. Such molecules may also be useful for 
treatment of disorders related-to defects in sperm-egg 
fusion and disorders of acrosome formation or function. 
T139 nucleic acids, polypeptides, and modulators may also 
be useful for treatment of various testicular disorders, 
15 e.g.. testicular cancer. T139 may serve as an antigen in 
a contraceptive vaccinogen. 

The present invention also provides a diagnostic 
assay for identifying the presence or absence of a 
genetic lesion or mutation characterized by at least one 
20 of: (i) aberrant modification or mutation of a gene 
encoding a T139 protein; (ii) mis -regulation of a gene 
encoding a T139 protein; and (iii) aberrant post- 
translational modification of a T139 protein, wherein a 
wild- type form of the gene encodes a protein with a T139 
25 activity. 

In another aspect, the invention provides a 
method for identifying a compound that binds to or 
modulates the activity of a T139 protein. In general, 
such methods entail measuring a biological activity of a 
30 T139 protein in the presence and absence of a test 

compound and identifying those compounds which alter the 
activity of the T139 protein. 

The invention also features methods for 
identifying a compound which modulates the expression of 
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T139 by measuring the expression of T139 in the presence 
and absence of a compound. 

Other features and advantages of the invention 
will be apparent from the following detailed description 
5 and claims. 

Rr-ipf Descrjp^'-'"" °f Drawings 
Figure 1 depicts the cDNA sequence (SEQ ID N0:1) 
and predicted amino acid sequence (SEQ ID N0:2) of human 
T139 (also referred to as "TANGO 139") . The open reading 
10 frame of SEQ ID N0:1 extends from nucleotide 95 to 
nucleotide 1432 (SEQ ID N0:3) . - 

Figures 2A-2C depict alignments of portions the 
amino acid sequences of T139 with various consensus 
sequences. Figure 2A shows the alignment of T13 9 amino 
15 acids 47 to 190 of SEQ ID NO; 2 with the SCP-like domain 
consensus sequence derived from a hidden Markov model 
(PF00188; SEQ ID N0;9) . Figure 2B shows the alignment of 
T139 amino acids 297 to 412 of SEQ ID N0:2 with the C- 
type lectin domain consensus sequence derived from a 
20 hidden Markov model (PF00059; SEQ ID NO:10). Figure 2C 
shows the alignment of T139 amino acids 232 to 260 (EGFl) 
and 264 to 291 (EGF2) of SEQ ID N0:2 with the EGF-like 
domain consensus sequence derived from a hidden Markov 
model (PF00008; SEQ ID N0:11) . 
25 Figure 3 is a hydropathy plot of T139. The 

position of cysteines (cys) are indicated by the vertical 
bars immediately below the plot. Relative hydrophobicity 
is shown above the dotted line, and relative 
hydrophilicity is shown below the line. 



30 



DPt-.aile^ npacription of the Inven tion 
The present invention is based on the discovery of 

a cDNA molecule encoding human T139. 

A nucleotide sequence encoding a human T139 

protein is shown in Figure 1 (SEQ ID N0:1; SEQ ID NO: 3 
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includes the open reading frame only) . A predicted amino 
acid sequence of T139 protein is also shown in Figure 1 

(SEQ ID NO: 2) . 

The T139 cDNA of Figure 1 (SEQ ID N0:1), which is 
5 approximately 1856 nucleotides long, including 

untranslated regions, encodes a protein amino acid having 
a molecular weight of approximately 49 kDa (excluding 
post-translational modifications) . A plasmid containing 
a cDNA encoding human T139 was deposited with American 
10 Type Culture Collection (ATCC) , Rockville, Maryland on 
March 12, 1998, and assigned Accession Number 98694. 
This deposit will be maintained under the terms of the 
Budapest Treaty on the International Recognition of the 
Deposit of Microorganisms for the Purposes of Patent 
15 Procedure. This deposit was made merely as a convenience 
for those of skill in the art and is not an admission 
that a deposit is required under 35 U.S.C. §112. 

sequence analysis revealed that T139 is homologous 
to testis-specific protein-1 (TPX-1) , a member of the 
20 SCP-like domain protein family. Comparison of the T139 
SCP-like domain with the SCP-like domain consensus (SEQ 
ID NO: 9) revealed that the T139 SCP-like domain is 28% 
identical (45/162 amino acids) and 50% similar (81/162 
amino acids) to the consensus. 
25 Alignment of the C-type lectin domain of human 

T139 protein (SEQ ID NO: 6) with the C-type lectin domain 
consensus sequence (SEQ ID NO: 10) revealed that the 
domains are 27% identical (28/103 amino acids) and 63% 
similar (65/103 amino acids) . C-type lectin domains 
30 appear to function as calcium-dependent carbohydrate- 
recognition domains and contain four conserved cysteines. 
The first and fourth cysteines and the second and third 
cysteines in the consensus participate in disulfide 
bonding with each other. One example of a protein having 
35 a C-type lectin domain is the REG protein, a 166 amino 
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acid polypeptide shown to stimulate beta-cell 
regeneration in a adult mouse pancreas. For a review on 
the REG protein, see Baeza et al. (1996) Diab. Metab. 
22:229-234. 

5 Alignment of the EGF-like domains of human T139 

protein (SEQ ID N0S:7 and 8) with the EGF-like domain 
consensus sequence (SEQ ID N0:11) revealed that the EGFl 
domain is 38% identical (13/34 amino acids) and 71% 
similar (24/34 amino acids). In general, EGF-like 

10 domains are found in the extracellular portion of 

membrane-bound proteins or in secreted proteins. EGF- 
like domains typically include six cysteine residues 
involved in disulfide bond formation with two conserved 
glycines between the fifth and sixth cysteine. The 

15 secondary structure of EGF-like domains appears to be a 
two-stranded 6-sheet followed by a loop to a C-terminal 
short two- stranded sheet. 

Tango 139 is expressed at high levels in the 
kidney and at low levels in the testis as an about 2.0 kb 

20 transcript. Additional T139 transcripts of about 2.4 kb 
and 3.5 kb were also present in these two tissues. No 
T139 expression was observed in the heart, brain, 
placenta, lung, liver, skeletal muscle, pancreas, spleen, 
thymus, ovaries, small intestine, colon, and peripheral 

25 blood leukocytes. 

Human T139 is one member of a family of molecules 
(the "T139 family") having certain conserved structural 
and functional features. The term "family" when 
referring to the protein and nucleic acid molecules of 

30 the invention is intended to mean two or more proteins or 
nucleic acid molecules having a common structural domain 
and having sufficient amino acid or nucleotide sequence 
identity as defined herein. Such family members can be 
naturally occurring and can be from either the same or 

35 different species. For example, a family can contain a 
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first protein of human origin and a hotnologue of that 
protein of murine origin, as well as a second, distinct 
protein of human origin and a murine homologue of that 
protein. Members of a family may also have common 
functional characteristics. 

in one embodiment, a T139 protein includes a SCP- 
like, C-type lectin, or EGF-like domain having at least 
about 65%, preferably at least about 75%, and more 
preferably about 85%, 95%, or 98% amino acid sequence 
identity to the SCP-like domain of SEQ ID N0:5, C-type 
lectin domain of SEQ ID NO:6, -or EGF-like domain of SEQ 

ID N0S:7 or 8. 

Preferred T139 polypeptides of the present 
invention have an amino acid sequence sufficiently 
identical to the SCP-like domain of SEQ ID N0:5, C-type 
lectin domain of SEQ ID N0:6, or EGF-like domain of SEQ 
ID N0S:7 or 8. As used herein, the term "sufficiently 
identical" refers to a first amino acid or nucleotide 
sequence which contains a sufficient or minimum number of 
identical or equivalent (e.g., an amino acid residue 
which has a similar side chain) amino acid residues or 
nucleotides to a second amino acid or nucleotide sequence 
such that the first and second amino acid or nucleotide 
sequences have a common structural domain and/or common 
functional activity. For example, amino acid or 
nucleotide sequences which contain a common structural 
domain having about 65% identity, preferably 75% 
identity, more preferably 85%, 95%, or 98% identity are 
defined herein as sufficiently identical. 

AS used interchangeably herein a "T139 activity", 
"biological activity of T139" or "functional activity of 
T139", refers to an activity exerted by a T139 protein, 
polypeptide or nucleic acid molecule on a T139 responsive 
cell as determined in vivo, or in vitro, according to 
35 standard techniques. A T139 activity can be a direct 



25 



30 
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activity, such as an association with or an enzymatic 
activity on a second protein or an indirect activity, 
such as a cellular signaling activity mediated by 
interaction of the T139 protein with a second protein. 
5 " in a preferred embodiment, a T139 activity includes at 
least one or more of the following activities: (i) 
interaction with other proteins; and (ii) interaction 

with a T13 9 receptor. 

Accordingly, another embodiment of the invention 
10 features isolated T139 proteins and polypeptides having a 

T139 activity. 

Yet another embodiment of the invention features 
T13 9 molecules which contain a signal sequence. 
Generally, a signal sequence (or signal peptide) is a 
15 peptide containing about 20 amino acids which occurs at 
the extreme N-terrainal end of secretory and integral 
membrane proteins and which contains large numbers of 
hydrophobic amino acid residues and serves to direct a 
protein containing such a sequence to a lipid bilayer. 

Various aspects of the invention are described in 
further detail in the following subsections. 



20 



T Taolated Nnfl eic A '^if^ Molecules 

One aspect of the invention pertains to isolated 
nucleic acid molecules that encode T139 proteins or 

25 biologically active portions thereof, as well as nucleic 
acid molecules sufficient for use as hybridization probes 
to identify T139-encoding nucleic acids (e.g., T139 mRNA) 
and fragments for use as PGR primers for the 
amplification or mutation of T139 nucleic acid molecules. 

30 As used herein, the term "nucleic acid molecule" is 

intended to include DNA molecules (e.g., cDNA or genomic 
DNA) and RNA molecules (e.g., mRNA) and analogs of the 
DNA or RNA generated using nucleotide analogs. The 
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nucleic acid molecule can be single -stranded or double - 
stranded, but preferably is double -stranded DNA. 

An "isolated" nucleic acid molecule is one which 
is separated from other nucleic acid molecules which are 
present in the natural source of the nucleic acid. 
Preferably, an -isolated" nucleic acid is free of 
sequences (preferably protein encoding sequences) which 
naturally flank the nucleic acid (i.e., sequences located 
at the 5' and 3' ends of the nucleic acid) in the genomic 
I DNA of the organism from which the nucleic acid is 
derived. For exaitqole, in various embodiments, the 
isolated T139 nucleic acid molecule can contain less than 
about 5 kb, 4 kb, 3 kb. 2 kb. 1 kb, 0.5 kb or 0.1 kb of 
nucleotide sequences which naturally flank the nucleic 
5 acid molecule in genomic DNA of the cell from which the 
nucleic acid is derived. Moreover, an "isolated" nucleic 
acid molecule, such as a cDNA molecule, can be 
substantially free of other cellular material, or culture 
medium when produced by recombinant techniques, or 
0 substantially free of chemical precursors or other 
chemicals when chemically synthesized. 

A nucleic acid molecule of the present invention, 
e.g.. a nucleic acid molecule having the nucleotide 
sequence of SEQ ID N0:1, SEQ ID NO: 3, or the cDNA of ATCC 
15 98694, or a complement of any of these nucleotide 
sequences, can be isolated using standard molecular 
biology techniques and the sequence information provided 
herein. Using all or portion of the nucleic acid 
sequences of SEQ ID N0:1, SEQ ID NO: 3, or the cDNA of 
30 ATCC 98694 as a hybridization probe, T139 nucleic acid 
molecules can be isolated using standard hybridization 
and cloning techniques (e.g., as described in Sambrook et 
al., eds., Molecular Cloning: A Laboratory Manual. 2nd, 
ed. ', Cold Spring Harbor Laboratory, Cold Spring Harbor 
35 Laboratory Press, Cold Spring Harbor, NY, 1989) . 
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A nucleic acid of the invention can be amplified 
using cDNA, mRNA or genomic DNA as a template and 
appropriate oligonucleotide primers according to standard 
PGR amplification techniques. The nucleic acid so 
5 amplified can be cloned into an appropriate vector and 
characterized by DNA sequence analysis. Furthermore, 
oligonucleotides corresponding to T139 nucleotide 
sequences can be prepared by standard synthetic 
techniques, e.g., using an automated DNA synthesizer. 
LO In another preferred embodiment, an isolated 

nucleic acid molecule of the invention comprises a 
nucleic acid molecule which is a complement of the 
nucleotide sequence shown in SEQ ID N0:1, SEQ ID NO: 3, or 
the CDNA of ATCC 98694, or a portion thereof. A nucleic 
15 acid molecule which is complementary to a given 
nucleotide sequence is one which is sufficiently 
complementary to the given nucleotide sequence that it 
can hybridize to the given nucleotide sequence thereby 
forming a stable duplex. 
20 Moreover, the nucleic acid molecule of the 

invention can comprise only a portion of a nucleic acid 
sequence encoding T139. for example, a fragment which can 
be used as a probe or primer or a fragment encoding a 
biologically active portion of T139. The nucleotide 
25 sequence determined from the cloning of the human T139 
gene allows for the generation of probes and primers 
designed for use in identifying and/or cloning T139 
homologues in other cell types, e.g., from other tissues, 
as well as T139 homologues from other mammals. The 
30 probe/primer typically comprises substantially purified 
oligonucleotide. The oligonucleotide typically comprises 
a region of nucleotide sequence that hybridizes under 
stringent conditions to at least about 12, preferably 
about 25, more preferably about 50. 75, 100, 125. 150, 
35 175 200. 250, 300. 350 or 400 consecutive nucleotides of 
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the sense or anti-sense sequence of SEQ ID N0:1, SEQ ID 
NO- 3, or the cDNA of ATCC 98694 or of a naturally 
occurring mutant of SEQ ID N0:1. SEQ ID NO: 3. or the cDNA 
of ATCC 98694. 

5 Probes based on the human T139 nucleotide sequence 

can be used to detect transcripts or genomic sequences 
encoding the same or identical proteins. The probe 
comprises a label group attached thereto, e.g., a 
radioisotope, a fluorescent compound, an enzyme, or an 

10 enzyme co-factor. Such probes can be used as a part of a 
diagnostic test kit for -identifying cells or tissue which 
mis-express a T139 protein, such as by measuring a level 
of a T13 9 -encoding nucleic acid in a sample of cells from 
a subject, e.g.. detecting T139 mRNA levels or 

15 determining whether a genomic T139 gene has been mutated 
or deleted. 

A nucleic acid fragment encoding a "biologically 
active portion of T139" can be prepared by isolating a 
portion of SEQ ID N0:1, SEQ ID N0:3, or the nucleotide 
20 sequence of the cDNA of ATCC 98694 which encodes a 

polypeptide having a T139 biological activity, expressing 
the encoded portion of T13 9 protein (e.g., by recombinant 
expression in vitro) and assessing the activity of the 
encoded portion of T139. For example, a nucleic acid 
25 fragment encoding a biologically active portion of T139 
includes a SCP-like. C-type lectin, or EGF-like domain, 
e.g., any of SEQ ID N0S:5-8. 

The invention further encompasses nucleic acid 
molecules that differ from the nucleotide sequence of SEQ 
30 ID NO:l, SEQ ID N0:3, or the cDNA of ATCC 98694 due to 
degeneracy of the genetic code and thus encode the same 
T139 protein as that encoded by the nucleotide sequence 
shown in SEQ ID N0:1, SEQ ID NO: 3, or the cDNA of ATCC 
98694. 
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In addition to the human T139 nucleotide sequence 
shown in SEQ ID N0:1, SEQ ID NO: 3, or the cDNA of ATCC 
98694, it will be appreciated by those skilled in the art 
that DNA sequence polymorphisms that lead to changes in 
5 the amino acid sequences of T139 may exist within a 
population (e.g., the human population). Such genetic 
polymorphism in the T139 gene may exist among individuals 
within a population due to natural allelic variation. As 
used herein, the terms "gene- and "recombinant gene" 
10 refer to nucleic acid molecules comprising an open 
reading frame encoding a--T139 protein, preferably a 
mammalian T139 protein. Such natural allelic variations 
can typically result in 1-5% variance in the nucleotide 
sequence of the T139 gene. Any and all such nucleotide 
15 variations and resulting amino acid polymorphisms in T139 
that are the result of natural allelic variation and that 
do not alter the functional activity of T139 are intended 
to be within the scope of the invention. 

Moreover, nucleic acid molecules encoding T139 
20 proteins from other species (T139 homologues) , which have 
a nucleotide sequence which differs from that of a human 
T13 9, are intended to be within the scope of the 
invention. Nucleic acid molecules corresponding to 
natural allelic variants and homologues of the T139 cDNA 
25 of the invention can be isolated based on their identity 
to the human T139 nucleic acids disclosed herein using 
the human cDNAs. or a portion thereof, as a hybridization 
probe according to standard hybridization techniques 
under stringent hybridization conditions. For example, a 
30 membrane-bound human T139 cDNA can be isolated based on 
its identity to soluble T139. 

Accordingly, in another embodiment, an isolated 
nucleic acid molecule of the invention is at least 300 
(325, 350, 375, 400, 425, 450, 500, 550, 600, 650, 700, 
35 800, 900, 1000, or 1290) nucleotides in length and 
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hybridizes under stringent conditions to the nucleic acid 
molecule comprising the nucleotide sequence, preferably 
the coding sequence, of SEQ ID NO:l, SBQ ID NO: 3, or the 

CDNA of ATCC 98694. 
5 AS used herein, the term "hybridizes under 

stringent conditions" is intended to describe conditions 
for hybridization and washing under which nucleotide 
sequences at least 60% (65%, 70%, preferably 75%) 
identical to each other typically remain hybridized to 
10 each other. Such stringent conditions are known to those 
skilled in the art and-can be found in Current Protocols 
in Molecular Biology, John Wiley & Sons, N.Y. (1989). 
6 3 1-6.3.6. A preferred, non-limiting example of 
stringent hybridization conditions are hybridization in 
15 6X sodium chloride/sodium citrate (SSC) at about 45oc, 
followed by one or more washes in 0.2 X SSC, 0.1% SDS at 
50-65»C. Preferably, an isolated nucleic acid molecule 
of the invention that hybridizes under stringent 
conditions to the sequence of SEQ ID N0:1, SEQ ID NO -.3, 
20 the cDNA of ATCC 98694 corresponds to a naturally- 
occurring nucleic acid molecule. As used herein, a 
"naturally-occurring" nucleic acid molecule refers to an 
RNA or DNA molecule having a nucleotide sequence that 
occurs in nature (e.g., encodes a natural protein). 
25 In addition to naturally- occurring allelic 

variants of the T139 sequence that may exist in the 
population, the skilled artisan will further appreciate 
that changes can be introduced by mutation into the 
nucleotide sequence of SEQ ID N0:1, SEQ ID N0:3, the cDNA 
30 of ATCC 98694, thereby leading to changes in the ammo 
acid sequence of the encoded T139 protein, without 
altering the functional ability of the T139 protein. For 
example, one can make nucleotide substitutions leading to 
amino acid substitutions at "non-essential" amino acid 
35 residues. A "non-essential" amino acid residue is a 
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residue that can be altered from the wild-type sequence 
of T139 (e.g., the sequence of SEQ ID N0:2) without 
altering the biological activity, whereas an "essential" 
amino acid residue is required for biological activity. 
5 For example, amino acid residues that are conserved among 
the T139 proteins of various species are predicted to be 
particularly unamenable to alteration. 

For example, preferred T139 proteins of the 
present invention, contain at least one SCP-like, C-type 
10 lectin, or EGF-like domain. Such conserved domains are 
less likely to be amenable to mutation. Other amino acid 
residues, however, (e.g., those that are not conserved or 
only semi-conserved among T139 of various species) may 
not be essential for activity and thus are likely to be 
15 amenable to alteration. 

Accordingly, another aspect of the invention 
pertains to nucleic acid molecules encoding T139 proteins 
that contain changes in amino acid residues that are not 
essential for activity. Such T139 proteins differ in 
20 amino acid sequence from SEQ ID N0:2 yet retain 

biological activity. In one embodiment, the isolated 
nucleic acid molecule includes a nucleotide sequence 
encoding a protein that includes an amino acid sequence 
that is at least about 45% identical, 65%, 75%, 85%, 95%, 
25 or 98% identical to the amino acid sequence of SEQ ID 
NO: 2. 

An isolated nucleic acid molecule encoding a T139 
protein having a sequence which differs from that of SEQ 
ID NO: 2 can be created by introducing one or more 

30 nucleotide substitutions, additions or deletions into the 
nucleotide sequence of SEQ ID N0:1, SEQ ID N0:3, the cDNA 
of ATCC 98694 such that one or more amino acid 
substitutions, additions or deletions are introduced into 
the encoded protein. Mutations can be introduced by 

35 standard techniques, such as site-directed mutagenesis 
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and PCR-mediated mutagenesis. Preferably, conservative 
amino acid substitutions are made at one or more 
predicted non-essential amino acid residues. A 
"conservative amino acid substitution" is one in which 
5 the amino acid residue is replaced with an amino acid 
residue having a similar side chain. Families of amino 
acid residues having similar side chains have been 
defined in the art. These families include amino acids 
with basic side chains (e.g., lysine, arginine, 
10 histidine), acidic side chains (e.g., aspartic acid, 
glutamic acid), uncharged polar side chains {e.g., 
glycine, asparagine, glutamine, serine, threonine, 
tyrosine, cysteine), nonpolar side chains (e.g.. alanine, 
valine, leucine, isoleucine. proline, phenylalanine, 
15 methionine, tryptophan), beta-branched side chains (e.g., 
threonine, valine, isoleucine) and aromatic side chains 
(e.g., tyrosine, phenylalanine, tryptophan, histidine). 
Thus, a predicted nonessential amino acid residue in T139 
is preferably replaced with another amino acid residue 
20 from the same side chain family. Alternatively, 

mutations can be introduced randomly along all or part of 
a T139 coding sequence, such as by saturation 
mutagenesis, and the resultant mutants can be screened 
for T139 biological activity to identify mutants that 
25 retain activity. Following mutagenesis, the encoded 
protein can be expressed recorabinantly and the activity 
of the protein can be determined. 

In a preferred embodiment, a mutant T139 protein 
can be assayed for: (1) the ability to form 
30 protein:protein interactions with proteins; or (2) the 
ability to bind a T139 receptor. In yet another 
preferred embodiment, a mutant T139 can be assayed for 
the ability to modulate cellular proliferation or 
cellular differentiation. 
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The present invention encompasses antisense 
nucleic acid molecules, i.e., molecules which are 
complementary to a sense nucleic acid encoding a protein, 
e.g., complementary to the coding strand of a double- 
5 stranded cDNA molecule or complementary to an mRNA 
sequence. Accordingly, an antisense nucleic acid can 
hydrogen bond to a sense nucleic acid. The antisense 
nucleic acid can be complementary to an entire T139 
coding strand, or to only a portion thereof, e.g., all or 
10 part of the protein coding region (or open reading 
frame) . An antisense- -nucleic acid molecule can be 
antisense to a noncoding region of the coding strand of a 
nucleotide sequence encoding T139. The noncoding regions 
("5' and 3' untranslated regions") are the 5' and 3' 
15 sequences which flank the coding region and are not 
translated into amino acids. 

Given the coding strand sequences encoding T139 
disclosed herein (e.g., SEQ ID N0:1 or SEQ ID N0:3) , 
antisense nucleic acids of the invention can be designed 
20 according to the rules of Watson and Crick base pairing. 
The antisense nucleic acid molecule can be complementary 
to the entire coding region of T139 mRNA, but more 
preferably is an oligonucleotide which is antisense to 
only a portion of the coding or noncoding region of T139 
25 mRNA. For example, the antisense oligonucleotide can be 
complementary to the region surrounding the translation 
start site of T139 mRNA, e.g.. an oligonucleotide having 
the sequence CTCTGGATGCAGCATGGGTCTGTTGGGCCC (SEQ ID 
NO -.12) or GATGCAGCATGGGTCTGTTG (SEQ ID NO: 13) . An 
30 antisense oligonucleotide can be, for example, about 5, 
10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in 
length'. An antisense nucleic acid of the invention can 
be constructed using chemical synthesis and enzymatic 
ligation reactions using procedures known in the art. 
35 For example, an antisense nucleic acid (e.g., an 
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antisense oligonucleotide) can be chemically synthesized 
using naturally occurring nucleotides or variously 
modified nucleotides designed to increase the biological 
stability of the molecules or to increase the physical 
5 stability of the duplex formed between the antisense and 
sense nucleic acids, e.g., phosphorothioate derivatives 
and acridine substituted nucleotides can be used. 
Examples of modified nucleotides which can be used to 
generate the antisense nucleic acid include 5- 
10 f luorouracil, 5-bromouracil , 5-chlorouracil, 5- 

iodouracil, hypoxanfehine , xanthine, 4- acetyl cytosine, 5- 
(carboxyhydroxylmethyl) uracil, 5- 
carboxymethylaminomethyl - 2 - thiouridine , 5- 
carboxymethylaminomethyluracil, dihydrouracil , beta-D- 
15 galactosyl queosine, inosine, N6-isopentenyl adenine, 1- 
methylguanine, 1-methylinosine, 2, 2-dimethylguanine, 2- 
methyladenine, 2-methylguanine, 3 -methyl cytosine, 5- 
methyl cytosine, N6- adenine, 7-methylguanine, 5- 
methylaminomethyluracil , 5 -methoxyaminomethyl - 2 - 
20 thiouracil, beta-D-mannosylqueosine, 5'- 

methoxycarboxymethyluracil , 5-methoxyuracil , 2 - 
methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid 
(v) , wybutoxosine, pseudouracil, queosine, 2- 
thiocytosine, 5 -methyl -2 -thiouracil, 2-thiouracil, 4- 
25 thiouracil, 5-methyluracil, uracil- 5 -oxyacetic acid 
methylester, uracil-5-oxyacetic acid (v) , 5-methyl-2- 
thiouracil, 3- (3-amino-3-N-2-carboxypropyl) uracil, 
(acp3)w, and 2 , 6-diaminopurine- Alternatively, the 
antisense nucleic acid can be produced biologically using 
30 an expression vector into which a nucleic acid has been 
subcloned in an antisense orientation (i.e., RNA 
transcribed from the inserted nucleic acid will be of an 
antisense orientation to a target nucleic acid of 
interest, described further in the following subsection). 
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The antisense nucleic acid molecules of the 
invention are typically administered to a subject or 
generated in situ such that they hybridize with or bind 
to cellular mRNA and/or genomic DNA encoding a T139 
5 protein to thereby inhibit expression of the protein, 
e.g., by inhibiting transcription and/or translation. 
The hybridization can be by conventional nucleotide 
complementarity to form a stable duplex, or, for example, 
in the case of an antisense nucleic acid molecule which 
10 binds to DNA duplexes, through specific interactions in 
the major groove of' the double helix. An example of a 
route of administration of antisense nucleic acid 
molecules of the invention include direct injection at a 
tissue site. Alternatively, antisense nucleic acid 
15 molecules can be modified to target selected cells and 
then administered systemically. For example, for 
systemic administration, antisense molecules can be 
modified such that they specifically bind to receptors or 
antigens esqjressed on a selected cell surface, e.g., by 
20 linking the antisense nucleic acid molecules to peptides 
or antibodies which bind to cell surface receptors or 
antigens. The antisense nucleic acid molecules can also 
be delivered to cells using the vectors described herein. 
To achieve sufficient intracellular concentrations of the 
25 antisense molecules, vector constructs in which the 
antisense nucleic acid molecule is placed under the 
control of a strong pol II or pol III promoter are 
preferred. 

An antisense nucleic acid molecule of the 
30 invention can be an a-anomeric nucleic acid molecule. An 
a-anomeric nucleic acid molecule forms specific do\able- 
stranded hybrids with complementary RNA in which, 
contrary to the usual ^ -units, the strands run parallel 
to each other (Gaultier et al. (1987) Nucleic Acids. Res. 
35 15:6625-6641). The antisense nucleic acid molecule can 
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also comprise a 2' -o-methylribonucleotide (Inoue et al. 
(1987) Nucleic Acids Res. 15:6131-6148) or a chimeric 
RNA-DNA analogue (Inoue et al. (1987) FEES Lett. 215:327- 
330) . 

5 The invention also encompasses ribozymes. 

Ribozymes are catalytic RNA molecules with ribonuclease 
activity which are capable of cleaving a single-stranded 
nucleic acid, such as an mRNA, to which they have a 
complementary region. Thus, ribozymes (e.g., hammerhead 
10 ribozymes (described in Haselhoff and Gerlach (1988) 

Wature 334 :585-591) )- can be used to catalytically cleave 
T139 mRNA transcripts to thereby inhibit translation of 
T139 mRNA. A ribozyme having specificity for a T139- 
encoding nucleic acid can be designed based upon the 
15 nucleotide sequence of a T139 cDNA disclosed herein 
(e.g., SEQ ID N0:1, SEQ ID N0:3) . For example, a 
derivative of a Tetrahymena L-19 IVS RNA can be 
constructed in which the nucleotide sequence of the 
active site is complementary to the nucleotide sequence 
20 to be cleaved in a T139-encoding mRNA. See, e.g., Cech 
et al. U.S. Patent No. 4,987,071; and Cech et al. U.S. 
Patent No. 5,116.742. Alternatively, T139 mRNA can be 
used to select a catalytic RNA having a specific 
ribonuclease activity from a pool of RNA molecules. See, 
25 e.g., Bartel and Szostak (1993) Science 261:1411-1418. 

The invention also encompasses nucleic acid 
molecules which form triple helical structures. For 
example, T139 gene expression can be inhibited by 
targeting nucleotide sequences complementary to the 
30 regulatory region of the T139 (e.g., the T139 promoter 
and/or enhancers) to form triple helical structures that 
prevent transcription of the T139 gene in target cells, 
see generally, Helene (1991) Anticancer Drug Des. 
6(6):569-84; Helene (1992) Ann. N.Y. Acad. Sci. 660:27- 
35 36; and Maher (1992) Bioassays 14 (12) : 807-15 . 
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In preferred embodiments, the nucleic acid 
molecules of the invention can be modified at the base 
moiety, sugar moiety or phosphate baclcbone to improve, 
e.g., the stability, hybridization, or solubility of the 
5 molecule. For example, the deoxyribose phosphate 

backbone of the nucleic acids can be modified to generate 
peptide nucleic acids (see Hyrup et al. (1996) Bioorganic 
& Medicinal Chemistry 4 (l):5-23). As used herein, the 
terms "peptide nucleic acids" or "PNAs" refer to nucleic 
10 acid mimics, e.g., DNA mimics, in which the deoxyribose 
phosphate backbone is replaced by a pseudopeptide 
backbone and only the four natural nucleobases are 
retained. The neutral backbone of PNAs has been shown to 
allow for specific hybridization to DNA and RNA under 
15 conditions of low ionic strength. The synthesis of PNA 
oligomers can be performed using standard solid phase 
peptide synthesis protocols as described in Hyrup et al. 
(1996) supra; Perry-0' Keef e et al . (1996) Proc. Natl. 
Acad. Sci. USA 93:14670-675. 
20 PNAs of T13 9 can be used therapeutic and 

diagnostic applications. For example, PNAs can be used 
as antisense or antigene agents for sequence -specific 
modulation of gene expression by, e.g., inducing 
transcription or translation arrest or inhibiting 
25 replication. PNAs of T139 can also be used, e.g., in the 
analysis of single base pair mutations in a gene by, 
e.g., PNA directed PGR clamping? as 'artificial 
restriction enzymes when used in combination with other 
enzymes, e.g., SI nucleases (Hyrup (1996) supra) or as 
3 0 probes or primers for DNA sequence and hybridization 
(Hyrup (1996) supra; Perry-0' Keef e et al . (1996) Proc. 
Natl. Acad. Sci. USA 93:14670-675). 

In another embodiment, PNAs of T139 can be 
modified, e.g., to enhance their stability or cellular 
35 uptake, by attaching lipophilic or other helper groups to 
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PNA. by the formation of PNA-DNA chimeras, or by the use 
of liposomes or other techniques of drug delivery known 
in the art. For example, PNA-DNA chimeras of T13 9 can be 
generated which may combine the advantageous properties 

5 of PNA and DNA. Such chimeras allow DNA recognition 
enzymes, e.g., RNAse H and DNA polymerases, to interact 
with the DNA portion while the PNA portion would provide 
high binding affinity and specificity. PNA-DNA chimeras 
can be linked using linkers of appropriate lengths 
10 selected in terms of base stacking, number of bonds 
between the nucleobases, and orientation (Hyrup (1996) 
supra) . The synthesis of PNA-DNA chimeras can be 
performed as described in Hyrup (1996) supra and Finn et 
al. (1996) Nucleic Acids Research 24 (17) : 3357-63 . For 

15 example, a DNA chain can be synthesized on a solid 

support using standard phosphoramidite coupling chemistry 
and modified nucleoside analogs, e.g., 5'-(4- 
methoxytrityl)amino-5' -deoxy- thymidine phosphoramidite, 
can be used as a between the PNA and the 5' end of DNA 

20 (Magetal. (1989) Nucleic Acid J?es . 17 : 5973-88) . PNA 
monomers are then coupled in a stepwise manner to produce 
a chimeric molecule with a 5' PNA segment and a 3' DNA 
segment (Finn et al. (1996) Nucleic Acids Research 
24(17) :3357-63) . Alternatively, chimeric molecules can 

25 be synthesized with a 5' DNA segment and a 3' PNA segment 
(Peterser et al. (1975) Bioorganic Med. Chem. Lett. 

5:1119-11124) . 

In other embodiments, the oligonucleotide may 
include other appended groups such as peptides (e.g., for 

30 targeting host cell receptors in vivo) . or agents 

facilitating transport across the cell membrane (see, 
e.g., Letsinger et al . (198$) Proc. Natl. Acad. Sci. USA 
86:6553-6556; Lemaitre et al . (1987) Proc. Natl. Acad. 
Sci. USA 84:648-652; PCT Publication No. W088/09810) or 

35 the blood-brain barrier (see, e.g., PCT Publication No. 
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W089/10134). In addition, oligonucleotides can be 
modified with hybridization-triggered cleavage agents 
(see e.g., Krol et al. (1988) Bio/Techniques 6:958-976) 
or intercalating agents (see, e.g., Zon (1988) Pharm. 
5 Res. 5:539-549). To this end, the oligonucleotide may be 
conjugated to another molecule, e.g., a peptide, 
hybridization triggered cross-linking agent, transport 
agent, hybridization- triggered cleavage agent, etc. 

TT T--'-^'="^ '^•'•^9 Pro ^^-i^« ^"d Anti-T1^9 Antibodies 
10 One aspect of the invention pertains to isolated 

T139 proteins, and biologically active portions thereof, 
as well as polypeptide fragments suitable for use as 
immunogens to raise anti-T139 antibodies. In one 
embodiment, native T139 proteins can be isolated from 

15 cells or tissue sources by an appropriate purification 
scheme using standard protein purification techniques, 
in another embodiment, T139 proteins are produced by 
recombinant DNA techniques. Alternative to recombinant 
expression, a T139 protein or polypeptide can be 

20 synthesized chemically using standard peptide synthesis 

techniques . 

An "isolated" or "purified" protein or 
biologically active portion thereof is substantially free 
of cellular material or other contaminating proteins from 
25 the cell or tissue source from which the T139 protein is 
derived, or substantially free from chemical precursors 
or other chemicals when chemically synthesized. The 
language "substantially free of cellular material" 
includes preparations of T139 protein in which the 
30 protein is separated from cellular components of the 
cells from which it is isolated or recombinantly 
produced. Thus, T139 protein that is substantially free 
of cellular material includes preparations of T13 9 
protein having less than about 30%, 20%, 10%, or 5% (by 
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dry weight) of non-T139 protein (also referred to herein 
as a "contaminating protein") . When the T139 protein or 
biologically active portion thereof is recombinantly 
produced, it is also preferably substantially free of 
5 culture medium, i.e., culture medium represents less than 
about 20%, 10%, or 5% of the volume of the protein 
preparation. When T139 protein is produced by chemical 
synthesis, it is preferably substantially free of 
chemical precursors or other chemicals, i.e., it is 
10 separated from chemical precursors or other chemicals 
which are involved in the synthesis of the protein. 
Accordingly such preparations of T13 9 protein have less 
than about 30%, 20%, 10%. 5% (by dry weight) of chemical 
precursors or non-T13 9 chemicals. 
15 Biologically active portions of a T139 protein 

include peptides comprising amino acid sequences 
sufficiently identical to or derived from the amino acid 
sequence of the T139 protein (e.g., the amino acid 
sequence shown in SEQ ID NO: 2 or SEQ ID NO: 4) , which 
20 include less amino acids than the full length T139 
proteins, and exhibit at least one activity of a T139 
protein. Typically, biologically active portions 
comprise a domain or motif with at least one activity of 
the T139 protein. A biologically active portion of a 
25 T139 protein can be a polypeptide which is, for example, 
10, 25, 50, 100 or more amino acids in length. Preferred 
biologically active polypeptides include one or more 
identified T139 structural domains, e.g., SCP-like domain 
(SEQ ID N0:5), C-type lectin domain (SEQ ID N0:6), or one 
30 of the EGF-like domains (SEQ ID NO: 7 and 8). 

Moreover, other biologically active portions, in 
which other regions of the protein are deleted, can be 
prepared by recombinant techniques and evaluated for one 
or more of the functional activities of a native T139 
35 protein. Preferred T139 protein has the amino acid 
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sequence shown of SEQ ID NO: 2. Other useful T139 
proteins are substantially identical to SEQ ID NO: 2 and 
retain the functional activity of the protein of SEQ ID 
NO: 2 yet differ in amino acid sequence due to natural 
5 allelic variation or mutagenesis. Accordingly, a useful 
T139 protein is a protein which includes an amino acid 
sequence at least about 45%, preferably 55%, 65%, 75%, 
85%, 95%, or 99% identical to the amino acid sequence of 
SEQ ID NO: 2 and retains the functional activity of the 

10 T139 proteins of SEQ ID N0:2. In other instances, the 
T13 9 protein is a protein having an amino acid sequence 
55%, 65%, 75%, 85%, 95%, or 98% identical to the SCP- 
like, C-type lectin, or EGF-like domain (any one of SEQ 
ID NOS:5-8). In a preferred embodiment, the T139 protein 

15 retains the functional activity of the T139 protein of 

SEQ ID NO:2. 

To determine the percent identity of two amino 
acid sequences or of two nucleic acids, the sequences are 
aligned for optimal comparison purposes (e.g., gaps can 
20 be introduced in the sequence of a first amino acid or 
nucleic acid sequence for optimal alignment with a second 
amino or nucleic acid sequence) . The amino acid residues 
or nucleotides at corresponding amino acid positions or 
nucleotide positions are then compared. When a position 
25 in the first sequence is occupied by the same amino acid 
residue or nucleotide as the corresponding position in 
the second sequence, then the molecules are identical at 
that position. The percent identity between the two 
sequences is a function of the number of identical 
30 positions shared by the sequences (i.e., % identity = # 
of identical positions/total # of positions x 100) . 

The determination of percent homology between two 
sequences can be accomplished using a mathematical 
algorithm. A preferred, non- limiting example of a 
35 mathematical algorithm utilized for the comparison of two 
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sequences is the algorithm of Karlin and Altschul (1990) 
Proc. Natl, Acad. Sci. USA 87:2264-2268, modified as in 
Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 
90:5873-5877. Such an algorithm is incorporated into the 
5 NBLAST and XBLAST programs of Altschul et al. (1990) J. 
Mol. Biol. 215:403-410. BLAST nucleotide searches can be 
performed with the NBLAST program, score = 100, 
wordlength = 12 to obtain nucleotide sequences homologous 
to T139 nucleic acid molecules of the invention. BLAST 

10 protein searches can be performed with the XBLAST 

program, score =50, wordlength = 3 to obtain amino acid 
sequences homologous to T139 protein molecules of the 
invention. To obtain gapped alignments for comparison 
purposes, Gapped BLAST can be utilized as described in 

15 Altschul et al. (1997) Nucleic Acids Res. 25:3389-3402. 
When utilizing BLAST and Gapped BLAST programs, the 
default parameters of the respective programs (e.g., 
XBLAST and NBLAST) can be used. See 

http://www.ncbi.nlm.nih.gov. Another preferred, non- 
20 limiting example of a mathematical algorithm utilized for 
the comparison of sequences is the algorithm of Myers and 
Miller, CABIOS (1989) . Such an algorithm is incorporated 
into the ALIGN program (version 2.0) which is part of the 
GCG sequence alignment software package. When utilizing 
25 the ALIGN program for comparing amino acid sequences, a 
PAM120 weight residue table, a gap length penalty of 12, 
and a gap penalty of 4 can be used. 

The percent identity between two sequences can be 
determined using techniques similar to those described 
30 above, with or without allowing gaps. In calculating 
percent identity, only exact matches are counted. 

The invention also provides T139. chimeric or 
fusion proteins. As used herein, a T139 "chimeric 
protein" or "fusion protein" comprises a T139 polypeptide 
35 operatively linked to a non-T139 polypeptide. A "T139 
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polypeptide" refers to a polypeptide having an amino acid 
sequence corresponding to T139. whereas a "non-T139 
polypeptide" refers to a polypeptide having an amino acid 
sequence corresponding to a protein which is not 
5 substantially identical to the T139 protein, e.g., a 
protein which is different from the T139 protein and 
which is derived from the same or a different organism. 
Within a T139 fusion protein the T139 polypeptide can 
correspond to all or a portion of a T139 protein, 

10 preferably at least one biologically active portion of a 
T139 protein. Within the fusion protein, the term 
"operatively linked" is intended to indicate that the 
T139 polypeptide and the non-T139 polypeptide are fused 
in- frame to each other. The non-T139 polypeptide can be 

15 fused to the N-terminus or C-terminus of the T139 

polypeptide . 

One useful fusion protein is a GST-T139 fusion 
protein in which the T139 sequences are fused to the C- 
terminus of the GST sequences. Such fusion proteins can 
20 facilitate the purification of recombinant T139. 

In another embodiment, the fusion protein is a 
T139 protein containing a heterologous signal sequence at 
its N-terminus. For example, the native T139 signal 
sequence (i.e.. about amino acids 1 to 22 of SEQ ID N0:2) 
25 can be removed and replaced with a signal sequence from 
another protein. In certain host cells (e.g., mammalian 
host cells) , expression and/or secretion of T139 can be 
increased through use of a heterologous signal sequence. 
For exaitple, the gp67 secretory sequence of the 
30 baculovirus envelope protein can be used as a 

heterologous signal sequence (Current Protocols in 
Molecular Biology. Ausubel et al., eds., John Wiley & 
Sons, 1992) . Other examples of eukaryotic heterologous 
signal sequences include the secretory sequences of 
35 melittin and human placental alkaline phosphatase 
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(Stratagene; La Jolla, California). In yet another 
exanple. useful prokaryotic heterologous signal sequences 
include the phoA secretory signal (Molecular cloning. 
Sambrook et al, second edition. Cold spring harbor 
5 laboratory press, 1989) and the protein A secretory 
signal (Pharmacia Biotech; Piscataway, New Jersey) . 

in yet another embodiment, the fusion protein is 
an T139-iramunoglobulin fusion protein in which all or 
part of T139 is fused to sequences derived from a member 
10 of the immunoglobulin protein family- (see, e.g., PCT 
publication Number WO 88/07087 and Aruffo et al . (1990) 
Cell 61:1303-1313). The T139-immunoglobulin fusion 
proteins of the invention can be incorporated into 
pharmaceutical compositions and administered to a subject 
15 to inhibit an interaction between a T139 receptor and a 
T139 protein on the surface of a cell, to thereby 
suppxess T139-mediated signal transduction in vivo. 
inhibition of the T139 ligand/T139 interaction may be 
useful therapeutically for both the treatment of 
20 proliferative and dif ferentiative disorders, as well as 
modulating (e.g. promoting or inhibiting) cell survival. 
Moreover, the T13 9 -immunoglobulin fusion proteins of the 
invention can be used as immunogens to produce anti-T139 
antibodies in a subject, to purify T139 receptors and m 
25 screening assays to identify molecules which inhibit the 
interaction of T139 with its receptor. 

Preferably, a T139 chimeric or fusion protein of 
the invention is produced by standard recombinant DNA 
techniques. For example, DNA fragments coding for the 
30 different polypeptide sequences are ligated together in- 
frame in accordance with conventional techniques, for 
example by employing blunt-ended or stagger-ended termini 
for ligation, restriction enzyme digestion to provide for 
appropriate termini, filling-in of cohesive ends as 
35 appropriate, alkaline phosphatase treatment to avoid 
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undesirable joining, and enzymatic ligation. In another 
embodiment, the fusion gene can be synthesized by 
conventional techniques including automated DNA 
synthesizers. Alternatively, PGR amplification of gene 
5 fragments can be carried out using anchor primers which 
give rise to complementary overhangs between two 
consecutive gene fragments which can subsequently be 
annealed and reamplified to generate a chimeric gene 
sequence (see, e.g.. Current Protocols in Molecular 
10 Biology. Ausubel ef al. eds., John Wiley & Sons: 1992). 
Moreover, many-expression vectors are commercially 
available that already encode a fusion moiety (e.g., a 
GST polypeptide) . An T139-encoding nucleic acid can be 
cloned into such an expression vector such that the 
15 fusion moiety is linked in-frame to the T139 protein. 

The present invention also pertains to variants of 
the T139 proteins which function as either T139 agonists 
(mimetics) or as T139 antagonists. Variants of the T139 
protein can be generated by mutagenesis, e.g., discrete 
20 point mutation or truncation of the T139 protein. An 
agonist of the T139 protein can retain substantially the 
same, or a subset, of the biological activities of the 
naturally occurring form of the T139 protein. An 
antagonist of the T139 protein can inhibit one or more of 
25 the activities of the naturally occurring form of the 
T139 protein by, for example, con^^etitively binding to a 
downstream or upstream member of a cellular signaling 
cascade which includes the T139 protein. Thus, specific 
biological effects can be elicited by treatment with a 
30 variant of limited function. Treatment of a subject with 
a variant having a subset of the biological activities of 
the naturally occurring form of the protein can have 
fewer side effects in a subject relative to treatment 
with the naturally occurring form of the T139 proteins. 
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Variants of the T139 protein which function as 
either T139 agonists (mimetics) or as T139 antagonists 
can be identified by screening combinatorial libraries of 
mutants, e.g., truncation mutants, of the T139 protein 
5 for T139 protein agonist or antagonist activity. In one 
embodiment, a variegated library of T139 variants is 
generated by combinatorial mutagenesis at the nucleic 
acid level and is encoded by a variegated gene library. 
A variegated library of T139 variants can be produced by. 
10 for example, enzymatically ligating a mixture of 

synthetic oligonucleotides into gene sequences such that 
a degenerate set of potential T139 sequences is 
expressible as individual polypeptides, or alternatively, 
as a set of larger fusion proteins (e.g., for phage 
15 display) containing the set of T139 sequences therein. 
There are a variety of methods which can be used to 
produce libraries of potential T139 variants from a 
degenerate oligonucleotide sequence. Chemical synthesis 
of a degenerate gene sequence can be performed in an 
20 automatic DNA synthesizer, and the synthetic gene then 
ligated into an appropriate expression vector. Use of a 
degenerate set of genes allows for the provision, in one 
mixture, of all of the sequences encoding the desired set 
of potential T139 sequences. Methods for synthesizing 
25 degenerate oligonucleotides are known in the art (see, 
e.g., Narang (1983) Tetrahedron 39:3; Itakura et al. 
(1984) Aimu. Rev. Biochem. 53:323; Itakura et al. (1984) 
Science 198:1056; Ike et al . (1983) Nucleic Acid J?es. 
11:477). 

30 In addition, libraries of fragments of the T139 

protein coding sequence can be used to generate a 
variegated population of T139 fragments for screening and 
subsequent selection of variants of a T139 protein. In 
one embodiment, a library of coding sequence fragments 

35 can be generated by treating a double stranded PGR 
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fragment of a T139 coding sequence with a nuclease under 
conditions wherein nicking occurs only about once per 
molecule, denaturing the double stranded DNA, renaturing 
the DNA to form double stranded DNA which can include 
5 sense/ant isense pairs from different nicked products, 
removing single stranded portions from reformed duplexes 
by treatment with SI nuclease, and ligating the resulting 
fragment library into an expression vector. By this 
method, an expression library can be derived which 
10 encodes N-terminal and internal fragments of various 
sizes of the T139 protein. 

Several techniques are known in the art for 
screening gene products of combinatorial libraries made 
by point mutations or truncation, and for screening cDNA 
15 libraries for gene products having a selected property. 
Such techniques are adaptable for rapid screening of the 
gene libraries generated by the combinatorial mutagenesis 
of T139 proteins. The most widely used techniques, which 
are amenable to high through-put analysis, for screening 
20 large gene libraries typically include cloning the gene 
library into replicable expression vectors, transforming 
appropriate cells with the resulting library of vectors, 
and expressing the combinatorial genes under conditions 
in which detection of a desired activity facilitates 
25 isolation of the vector encoding the gene whose product 
was detected. Recursive ensemble mutagenesis (REM) , a 
technique which enhances the frequency of functional 
mutants in the libraries, can be used in combination with 
the screening assays to identify T139 variants (Arkin and 
30 Yourvan (1992) Proc. Natl. Acad. Sci. USA 89:7811-7815; 
Delgrave et al. (1993) Protein jSngineering 6(3) =327-331) - 
An isolated T139 protein, or a portion or fragment 
thereof, can be used as an immunogen to generate 
antibodies that bind T139 using standard techniques for 
35 polyclonal and monoclonal antibody preparation. The 
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full-length T139 protein can be used or, alternatively, 
the invention provides antigenic peptide fragments of 
T139 for use as immunogens. The antigenic peptide of 
T139 comprises at least 8 (preferably 10. 15, 20, or 30) 
5 amino acid residues of the amino acid sequence shown in 
SEQ ID NO: 2 and encompasses an epitope of T139 such that 
an antibody raised against the peptide forms a specific 
immune complex with T139. 

Preferred epitopes encompassed by the antigenic 

10 peptide are regions of T139 that are located on the 
surface of the protein, e.g., hydrophilic regions. A 
hydrophobicity analysis of the human T139 protein 
sequence indicates that the regions between, e.g., amino 
acids 125 and 140, between amino acids 265 and 280, and 

15 between amino acids 315 and 335 of SEQ ID NO: 2 are 

particularly hydrophilic and, therefore, are likely to 
encode surface residues useful for targeting antibody 
product ion . 

A T139 immunogen typically is used to prepare 
20 antibodies by immunizing a suitable subject, (e.g., 

rabbit, goat, mouse or other mammal) with the immunogen. 
An appropriate immunogenic preparation can contain, for 
example, recombinantly expressed T139 protein or a 
chemically synthesized T139 polypeptide. The preparation 
25 can further include an adjuvant, such as Freund' s 
complete or incomplete adjuvant, or similar 
immunostimulatory agent. Immunization of a suitable 
subject with an immunogenic T139 preparation induces a 
polyclonal anti-T139 antibody response. 
30 Accordingly, another aspect of the invention 

pertains to anti-T139 antibodies. The term "antibody as 
used herein refers to immunoglobulin molecules and 
immunologically active portions of immunoglobulin 
molecules, i.e., molecules that contain an antigen 
35 binding site which specifically binds an antigen, such as 
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T139. A molecule which specifically binds to T139 is a 
molecule which binds T139. but does not substantially 
bind other molecules in a sample, e.g.. a biological 
sample, which naturally contains T139. Examples of 
5 immunologically active portions of immunoglobulin 

molecules include F(ab) and FCab')^ fragments which can be 
generated by treating the antibody with an enzyme such as 
pepsin. The invention provides polyclonal and monoclonal 
antibodies that bind T139. The term "monoclonal 
10 antibody" or "monoclonal antibody composition", as used 
herein, refers to a population of antibody molecules that 
contain only one species of an antigen binding site 
capable of immunoreacting with a particular epitope of 
T139. A monoclonal antibody composition thus typically 
15 displays a single binding affinity for a particular T139 
protein with which it immunoreacts . 

Polyclonal anti-T139 antibodies can be prepared as 
described above by immunizing a suitable subject with a 
T139 immunogen. The anti-T139 antibody titer in the 
20 immunized subject can be monitored over time by standard 
techniques, such as with an enzyme linked immunosorbent 
assay (ELISA) using immobilized T13 9 . If desired, the 
antibody molecules directed against T139 can be isolated 
from the mammal (e.g., from the blood) and further 
25 purified by well-known techniques, such as protein A 
chromatography to obtain the IgG fraction. At an 
appropriate time after immunization, e.g., when the anti- 
T139 antibody titers are highest, antibody -producing 
cells can be obtained from the subject and used to 
30 prepare monoclonal antibodies by standard techniques, 
such as the hybridoma technique originally described by 
Kohler and Milstein (1975) Nature 256:495-497, the human 
B cell hybridoma technique (Kozbor et al . (1983) Immunol 
Today 4:72), the EBV- hybridoma technique (Cole et al. 
35 (1985), Monoclonal Antibodies and Cancer Therapy, Alan R. 



W099/S4343 



PCT/US99/08896 



- 37 - 

Liss. Inc., pp. 77-96) or trioma techniques. The 
technology for producing various antibodies monoclonal 
antibody hybridomas is well knovm (see, e.g., Current 
Protocols in Iimunology (1994) Coligan et al. (eds.) John 
5 Wiley & sons. Inc., New York, NY). Briefly, an immortal 
cell line (typically a myeloma) is fused to lymphocytes 
(typically splenocytes) from a mammal immunized with a 
T139 immunogen as described above, and the culture 
supematants of the resulting hybridoma cells are 
10 screened to identify a hybridoma producing a monoclonal 
antibody that binds T139. 

Any of the many well known protocols used for 
fusing lymphocytes and immortalized cell lines can be 
applied for the purpose of generating an anti-T139 
15 monoclonal antibody (see, e.g.. Current Protocols in 

immunology, supra; Galfre et al. (1977) Nature 266:55052; 
R.H. Kenneth, in Monoclonal Antibodies: A New Dimension 
In Biological Analyses. Plenum Publishing Corp., New 
York, New York (1980); and Lemer (1981) Yale J. Biol. 
20 Med., 54:387-402. Moreover, the ordinarily skilled 

worker will appreciate that there are many variations of 
such methods which also would be useful. Typically, the 
immortal cell line (e.g., a myeloma cell line) is derived 
from the same mammalian species as the lymphocytes. For 
25 example, murine hybridomas can be made by fusing 

lymphocytes from a mouse immunized with an immunogenic 
preparation of the present invention with an immortalized 
mouse cell line, e.g., a myeloma cell line that is 
sensitive to culture medium containing hypoxanthine , 
30 aminopterin and thymidine ("HAT medium")- Any of a 
number of myeloma cell lines can be used as a fusion 
partner according to standard techniques, e.g., the P3- 
NSl/l-Ag4-l, P3-x63-Ag8.653 or Sp2/0-Agl4 myeloma lines. 
These myeloma lines are available from ATCC. Typically, 
35 HAT-sensitive mouse myeloma cells are fused to mouse 
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splenocytes using polyethylene glycol ("PEG"). Hybridoma 
cells resulting from the fusion are then selected using 
HAT medium, which kills unfused and unproductively fused 
myeloma cells (unfused splenocytes die after several days 
5 because they are not transformed) . Hybridoma cells 
producing a monoclonal antibody of the invention are 
detected by screening the hybridoma culture supematants 
for antibodies that bind T139, e.g., using a standard 
ELISA assay. 

10 Alternative to preparing monoclonal antibody- 

secreting hybridomas, a monoclonal anti-T139 antibody can 
be identified and isolated by screening a recombinant 
combinatorial immunoglobulin library (e.g., an antibody 
phage display library) with T139 to thereby isolate 
15 immunoglobulin library members that bind T139. Kits for 
generating and screening phage display libraries are 
commercially available (e.g., the Pharmacia Recombinant 
Phage Antibody System. Catalog No. 27-9400-01; and the 
Stratagene Surf ZAP™ Phage Display Kit. Catalog No. 
20 240612) . Additionally, examples of methods and reagents 
particularly amenable for use in generating and screening 
antibody display library can be found in, for example, 
U.S. Patent No. 5,223,409; PCT Publication No. WO 
92/18619; PCT Publication No. WO 91/17271; PCT 
25 Publication WO 92/20791; PCT Publication No. WO 92/15679; 
PCT Publication WO 93/01288; PCT Publication No. WO 
92/01047;. PCT Publication No. WO 92/09690; PCT 
Publication No. WO 90/02809; Fuchs et al . (1991) 
Bio/Technology 9:1370-1372; Hay et al. (1992) Hum. 
30 Antibod. Hybridomas 3:81-85; Huse et al. (1989) Science 
246:1275-1281; Griffiths et al. (1993) EMBO J 12 : 725-734 . 

Additionally, recombinant anti-T139 antibodies, 
such as chimeric and humanized monoclonal antibodies, 
comprising both human and non-human portions, which can 
35 be made using standard recombinant DNA techniques, are 
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within the scope of the invention. Such chimeric and 
humanized monoclonal antibodies can be produced by 
recombinant DNA techniques known in the art, for example 
using methods described in PCT Publication No. WO 
5 87/02671; European Patent Application 184,187; European 
Patent Application 171,496; European Patent Application 
173,494; PCT Publication No. WO 86/01533; U.S. Patent No. 
4,816,567; European Patent Application 125,023; Better et 
al. (1988) Science 240:1041-1043; Liu et al . (1987) Proc. 

10 Natl. Acad. Sci. USA 84:3439-3443; Liu et al . (1987) J. 
Immunol. 139:3521-3526; Sun et al . (1987) Proc. Watl. 
Acad. Sci. USA 84:214-218; Nishimura et al . (1987) Cane. 
Res. 47:999-1005; Wood et al . (1985) Nature 314:446-449; 
and Shaw et al. (1988) J. Natl. Cancer Inst. 80:1553- 

15 1559); Morrison, (1985) Science 229:1202-1207; Oi et al. 
(1986) Bio/Techniques 4:214; U.S. Patent 5,225.539; Jones 
et al. (1986) Nature 321:552-525; Verhoeyan et al. (1988) 
Science 239:1534; and Beidler et al. (1988) J. Immunol. 
141:4053-4060. 

20 An anti-T139 antibody (e.g., monoclonal antibody) 

can be used to isolate T139 by standard techniques, such 
as affinity chromatography or immunoprecipitation. An 
anti-T139 antibody can facilitate the purification of 
natural T139 from cells and of recombinantly produced 
25 T139 expressed in host cells. Moreover, an anti-T139 
antibody can be used to detect T139 protein (e.g., in a 
cellular lysate or cell supernatant) in order to evaluate 
the abundance and pattern of expression of the T139 
protein. Anti-T139 antibodies can be used diagnostically 

30 to monitor protein levels in tissue as part of a clinical 
testing procedure, e.g., to, for example, determine the 
efficacy of a given treatment regimen. Detection can be 
facilitated by coupling the antibody to a detectable 
substance. Examples of detectable substances include 

35 various enzymes, prosthetic groups, fluorescent 
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materials, luminescent materials, bioluminescent 
materials, and radioactive materials. Examples of 
suitable enzymes include horseradish peroxidase, alkaline 
phosphatase, jS-galactosidase, or acetylcholinesterase; 
5 examples of suitable prosthetic group complexes include 
streptavidin/biotin and avidin/biotin; examples of 
suitable fluorescent materials include umbel 1 if erone, 
fluorescein, fluorescein isothiocyanate, rhodamine. 
dichlorotriazinylamine fluorescein, dansyl chloride or 
10 phycoerythrin; an example of a luminescent material 
includes luminol; examples of bioluminescent materials 
include luciferase, luciferin, and aequorin, and examples 
of suitable radioactive material include "'I, "S or 

15 TTI. Recnmhinant P x preasion Vectors and Host Cells 
Another aspect of the invention pertains to 
vectors, preferably expression vectors, containing a 
nucleic acid encoding T139 (or a portion thereof) . As 
used herein, the term "vector- refers to a nucleic acid 
20 molecule capable of transporting another nucleic acid to 
which it has been linked. One type of vector is a 
"plasmid", which refers to a circular double stranded DNA 
loop into which additional DNA segments can be ligated. 
Another type of vector is a viral vector, wherein 
25 additional DNA segments can be ligated into the viral 
genome. Certain vectors are capable of autonomous 
replication in a host cell into which they are introduced 
(e.g., bacterial vectors having a bacterial origin of 
replication and episomal mammalian vectors) . Other 
30 vectors (e.g., non- episomal mammalian vectors) are 
integrated into the genome of a host cell upon 
introduction into the host cell, and thereby are 
replicated along with the host genome. Moreover, certain 
vectors, expression vectors, are capable of directing the 
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expression of genes to which they are operatively linked. 
In general, expression vectors of utility in recombinant 
DMA techniques are often in the form of plasmids 
(vectors) . However, the invention is intended to include 
5 such other forms of expression vectors, such as viral 
vectors (e.g., replication defective retroviruses, 
adenoviruses and adeno-associated viruses) , which serve 
equivalent functions. 

The recombinant expression vectors of the 
10 invention comprise a nucleic acid of the invention in a 
form suitable for expression of the nucleic acid in a 
host cell, which means that the recombinant expression 
vectors include one or more regulatory sequences, 
selected on the basis of the host cells to be used for 
15 expression, which is operatively linked to the nucleic 

acid sequence to be expressed. Within a recombinant 
. expression vector, "operably linked" is intended to mean 
that the nucleotide sequence of interest is linked to the 
regulatory sequence (s) in a manner which allows for 
20 expression of the nucleotide sequence (e.g., in an in 
vitro transcription/translation system or in a host cell 
when the vector is introduced into the host cell) . The 
term "regulatory sequence" is intended to include 
promoters, enhancers and other expression control 
25 elements (e.g., polyadenylation signals) . Such 

regulatory sequences are. described, for example, in 
Goeddel; Gene Expression Technology: Methods in 
Enzymology 185. Academic Press, San Diego. CA (1990). 
Regulatory sequences include those which direct 
30 constitutive expression of a nucleotide sequence in many 
types of host cell and those which direct expression of 
the nucleotide sequence only in certain host cells (e.g., 
tissue-specific regulatory sequences) . It will be 
appreciated by those skilled in the art that the design 
35 of the expression vector can depend on such factors as 
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the choice of the host cell to be transformed, the level 
of expression of protein desired, etc. The expression 
vectors of the invention can be introduced into host 
cells to thereby produce proteins or peptides, including 
5 fusion proteins or peptides, encoded by nucleic acids as 
described herein (e.g., T139 proteins, mutant forms of 
T139, fusion proteins, etc). 

The recombinant expression vectors of the 
invention can be designed for expression of T13 9 in 
10 prokaryotic or eukaryotic cells, e.g., bacterial cells 
such as E. coli, insect cells (using baculovirus 
expression vectors) yeast cells or mammalian cells. 
Suitable host cells are discussed further in Goeddel, 
Gene Expression Technology: Met±ods in Enzymology 185, 
15 Academic Press, San Diego, CA (1990). Alternatively, the 
recombinant expression vector can be transcribed and 
translated in vitro, for example using T7 promoter 
regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most 
20 often carried out in E. coli with vectors containing 
constitutive or inducible promoters directing the 
expression of either fusion or non-fusion proteins. 
Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the 
25 recombinant protein. Such fusion vectors typically serve 
three purposes: 1) to increase expression of recombinant 
protein; 2) to increase the solubility of the recombinant 
protein; and 3) to aid in the purification of the 
recombinant protein by acting as a ligand in affinity 
30 purification. Often, in fusion expression vectors, a 
proteolytic cleavage site is introduced at the junction 
of the fusion moiety and the recombinant protein to 
enable separation of the recombinant protein from the 
fusion moiety subsequent to purification of the fusion 
35 protein. Such enzymes, and their cognate recognition 
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sequences, include Factor Xa, thrombin and enterokinase . 
Typical fusion expression vectors include pGEX (Pharmacia 
Biotech Inc; Smith and Johnson (1988) Gene 67:31-40), 
pMAL (New England Biolabs, Beverly, MA) and pRIT5 
5 (Pharmacia, Piscataway, NJ) which fuse glutathione S- 
transf erase (GST) , maltose E binding protein, or protein 
A, respectively, to the target recombinant protein. 

Examples of suitable inducible non- fusion E. coli 
expression vectors include pTrc (Amann et al., (1988) 
10 Gene 69:301-315) and pET lid (Studier et al., Gene 
Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, California (1990) 60-89) . 
Target gene expression from the pTrc vector relies on 
host RNA polymerase transcription from a hybrid trp-lac 
15 fusion promoter. Target gene expression from the pET lid 
vector relies on transcription from a T7 gnlO-lac fusion 
promoter mediated by a coexpressed viral RNA polymerase 
(T7 gnl) . This viral polymerase is supplied by host 
strains BL21(DE3) or HMS174(DE3) from a resident X 
20 prophage harboring a T7 gnl gene under the 

transcriptional control of the lacUV 5 promoter. 

One strategy to maximize recombinant protein 
expression in E. coli is to express the protein in a host 
bacteria with an impaired capacity to proteolytically 
25 cleave the recombinant protein (Gottesman, Gene 
Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, California (1990) 119-128). 
Another strategy is to alter the nucleic acid sequence of 
the nucleic acid to be inserted into an expression vector 
30 so that the individual codons for each amino acid are 
those preferentially utilized in E. coli (Wada et al . 
(1992) Wucleic Acids J?es. 20:2111-2118). Such alteration 
of nucleic acid sequences of the invention can be carried 
out by standard DNA synthesis techniques. 
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In another embodiment, the T139 expression vector 
is a yeast expression vector. Examples of vectors for 
expression in yeast S. ceriviaae include pYepSecl 
(Baldari et al. (1987) EMBO J. 6:229-234), pMFa (Kurjan 
5 and Herskowitz, (1982) Cell 30:933-943), pJRYBS (Schultz 
et al. (1987) Gfene 54:113-123), pYES2 (Invitrogen 
Corporation, San Diego, CA) , and picZ (InVitrogen Corp, 

San Diego, CA) . 

Alternatively, T139 can be expressed in insect 

10 cells using baculovirus expression vectors. Baculovirus 
vectors available for expression of proteins in cultured 
insect cells (e.g., Sf 9 cells) include the pAc series 
(Smith et al. (1983) Wol. Cell Biol. 3:2156-2165) and the 
pVL series (Lucklow and Summers (1989) Virology 170:31- 

15 39) . 

In yet another embodiment, a nucleic acid of the 
invention is expressed in mammalian cells using a 
mammalian expression vector. Examples of mammalian 
expression vectors include pCDMS (Seed (1987) Nature 
20 329:840) and pMT2PC (Kaufman et al. (1987) EMBO J . 6:187- 
195) . When used in mammalian cells, the expression 
vector's control functions are often provided by viral 
regulatory elements. For example, commonly used 
promoters are derived from polyoma. Adenovirus 2, 
25 cytomegalovirus and Simian Virus 40. For other suitable 
expression systems for both prokaryotic and eukaryotic 
cells see chapters 16 and 17 of Sambrook et al. (supra). 

In another embodiment, the recombinant mammalian 
expression vector is capable of directing expression of 
3 0 the nucleic acid preferentially in a particular cell type 
(e.g., tissue-specific regulatory elements are used to 
express the nucleic acid) . Tissue- specif ic regulatory 
elements are known in the art. Non- limiting examples of 
suitable tissue-specific promoters include the albumin 
35 promoter (liver-specific; Pinkert et al. (1987) Genes 
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Dev. 1:268-277), lymphoid- specif ic promoters (Calame and 
Eaton (1988) Adv. Immunol. 43:235-275). in particular 
promoters of T cell receptors (Winoto and Baltimore 
(1989) EMBO J. 8:729-733) and immunoglobulins (Banerji et 
5 al. (1983) Cell 33:729-740; Queen and Baltimore (1983) 
Cell 33:741-748), neuron- specif ic promoters (e.g., the 
neurofilament promoter; Byrne and Ruddle (1989) Proc. 
Natl. Acad. Sci. USA 86:5473-5477), pancreas- specif ic 
promoters (Edlundetal. (1985) Science 230 :912-916) , and 
10 mammary gland-specific promoters (e.g., milk whey 
promoter; U.S. Patent No. 4,873,316 and European 
Application Publication No. 264,166). Developmentally- 
regulated promoters are also encompassed, for example the 
murine hox promoters (Kessel and Gruss (1990) Science 
15 249:374-379) and the a-f etoprotein promoter (Campes and 
Tilghraan (1989) Genes Dev. 3:537-546). 

The invention further provides a recombinant 
expression vector comprising a DNA molecule of the 
invention cloned into the expression vector in an 
20 antisense orientation. That is, the DNA molecule is 
operatively linked to a regulatory sequence in a manner 
which allows for expression (by transcription of the DNA 
molecule) of an RNA molecule which is antisense to T139 
mRNA. Regulatory sequences operatively linked to a 
25 nucleic acid cloned in the antisense orientation can be 
chosen which direct the continuous expression of the 
antisense RNA molecule in a variety of cell types, for 
instance viral promoters and/or enhancers, or regulatory 
sequences can be chosen which direct constitutive, tissue 
30 specific or cell type specific expression of antisense 
RNA. The antisense expression vector can be in the form 
of a recombinant plasmid, phagemid or attenuated virus in 
which antisense nucleic acids are produced under the 
control of a high efficiency regulatory region, the 
35 activity of which can be determined by the cell type into 
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Which the vector is introduced. For a discussion of the 
regulation of gene expression using antisense genes See 
Weintraub et al.. Reviews - Trends in Genetics. Vol. 1(1) 

5 Another aspect of the invention pertains to host 

cells into which a recon^inant expression vector of the 
invention has been introduced. The terms "host cell- and 
-recombinant host cell" are used interchangeably herexn. 
It is understood that such terms refer not only to the 
10 particular subject cell but to the progeny or potential 
progeny of such a cell. Because certain modifications 
ntay occur in succeeding generations due to either 
nvutation or environmental influences, such progeny may 
not. in fact, be identical to the parent cell, but are 

15 still included within the scope of the term as used 

herein. , *_ • ^ 

A host cell can be any prokaryotic or eukaryotic 

cell For example, T139 protein can be expressed in 
bacterial cells such as E. ccli, insect cells, yeast or 
20 mammalian cells (such as Chinese hamster ovary -l^^ 

(CHO) or COS cells) . Other suitable host cells are known 
to those skilled in the art. 

vector DNA can be introduced into prokaryotic or 
eukaryotic cells via conventional transformation or 
25 transfection techniques. As used herein, the terms 

"transformation" and "transfection" are intended to refer 
to a variety of art -recognized techniques for introducing 
foreign nucleic acid (e.g.. DNA) into a host cell, 
including calcium phosphate or calcium chloride co- 
30 precipitation, DEAE-dextran-mediated transfection, 

lipofection, or electroporation. Suitable methods for 
transforming or transfecting host cells can be found in 
Sambrook, et al. (supra), and other laboratory manuals. 

For stable transfection of mammalian cells, it is 
35 known that, depending upon the expression vector and 
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transfection technique used, only a small fraction of 
cells may integrate the foreign DNA into their genome, 
in order to identify and select these integrants, a gene 
that encodes a selectable marker (e.g., resistance to 
5 antibiotics) is generally introduced into the host cells 
along with the gene of interest . Preferred selectable 
markers include those which confer resistance to drugs, 
such as G418, hygromycin and methotrexate. Nucleic acid 
encoding a selectable marker can be introduced into a 
10 host cell on the same vector as that encoding T139 or can 
be introduced on a separate vector. Cells ^stably 
transfected with the introduced nucleic acid can be 
identified by drug selection (e.g., cells that have 
incorporated the selectable marker gene will survive, 
15 while the other cells die) . 

A host cell of the invention, such as a 
prokaryotic or eukaryotic host cell in culture, can be 
used to produce (i.e., express) T139 protein. 
Accordingly, the invention further provides methods for 
20 producing T139 protein using the host cells of the 
invention. In one embodiment, the method comprises 
culturing the host cell of invention (into which a 
recombinant expression vector encoding T139 has been 
introduced) in a suitable medium such that T13 9 protein 
25 is produced. In another embodiment, the method further 
comprises isolating T139 from the medium or the host 
cell . 

The host cells of the invention can also be used 
to produce nonhuman transgenic animals. For example, in 

30 one embodiment, a host cell of the invention is a 

fertilized oocyte or an embryonic stem cell into which 
T139-coding sequences have been introduced. Such host 
cells can then be used to create non-human transgenic 
animals in which exogenous T139 sequences have been 

35 introduced into their genome or homologous recombinant 
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animals in which endogenous T139 sequences have been 
altered. Such animals are useful for studying the . 
function and/or activity of T139 and for identifying 
and/or evaluating modulators of T139 activity. As used 
5 herein, a "transgenic animal" is a non-human animal, 
preferably a mammal, more preferably a rodent such as a 
rat or mouse, in which one or more of the cells of the 
animal includes a transgene. Other examples of 
transgenic animals include non-human primates, sheep, 
10 dogs, cows, goats, chickens, amphibians, etc. A 

transgene is exogenous DNA which is integrated into the 
genome of a cell from which a transgenic animal develops 
and which remains in the genome of the mature animal, 
thereby directing the expression of an encoded gene 
15 product in one or more cell types or tissues of the 
transgenic animal. As used herein, an "homologous 
recombinant animal" is a non-human animal, preferably a 
mammal, more preferably a mouse, in which an endogenous 
T139 gene has been altered by homologous recombination 
20 between the endogenous gene and an exogenous DNA molecule 
introduced into a cell of the animal, e.g., an embryonic 
cell of the animal, prior to development of the animal. 

A transgenic animal of the invention can be 
created by introducing T13 9 -encoding nucleic acid into 
25 the male pronuclei of a fertilized oocyte, e.g., by 
microinjection, retroviral infection, and allowing the 
oocyte to develop in a pseudopregnant female foster 
animal. The T139 cDNA sequence e.g., that of (SEQ ID 
NO-.l, SEQ ID N0:3, or the cDNA of ATCC 98694) can be 
30 introduced as a transgene into the genome of a non-human 
animal. Alternatively, a nonhuman homologue of the human 
T139 gene, such as a mouse T139 gene, can be isolated 
based on hybridization to the human T139 cDNA and used as 
a transgene. Intronic sequences and polyadenylation 
35 signals can also be included in the transgene to increase 
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the efficiency of expression of the transgene. A tissue - 
specific regulatory sequence (s) can be operably linked to 
the T139 transgene to direct expression of T139 protein 
to particular cells. Methods for generating transgenic 
5 animals via embryo manipulation and microinjection, 
particularly animals such as mice, have become 
conventional in the art and are described, for example, ^ 
in U.S. Patent Nos. 4,736,866 and 4,870,009, U.S. Patent 
No. 4,873,191 and in Hogan, Manipulating the Mouse 
10 Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N.Y., 1986). Similar methods are used for 
production of other transgenic animals. A transgenic 
founder animal can be identified based upon the presence 
of the T139 transgene in its genome and/ or expression of 
15 T139 mRNA in tissues or cells of the animals. A 
transgenic founder animal can then be used to breed 
additional animals carrying the transgene. Moreover, 
transgenic animals carrying a transgene encoding T139 can 
further be bred to other transgenic animals carrying 
20 other transgenes. 

To create an homologous recombinant animal, a 
vector is prepared which contains at least a portion of a 
T13 9 gene (e.g., a human or a non-human homolog of the 
T139 gene, e.g., a murine T139 gene) into which a 
25 deletion, addition or substitution has been introduced to 
thereby alter, e.g., functionally disrupt, the T139 gene. 
In a preferred embodiment, the vector is designed such 
that, upon homologous recombination, the endogenous T139 
gene is functionally disrupted (i.e., no longer encodes a 
30 functional protein; also referred to as a "knock out" 

vector) . Alternatively, the vector can be designed such 
that, upon homologous recombination, the endogenous T139 
gene is mutated or otherwise altered but still encodes 
functional protein (e.g., the upstream regulatory region 
35 can be altered to thereby alter the expression of the 
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endogenous T139 protein) -In the homologous 
recombination vector, the altered portion of the T139 
gene is flanked at its 5' and 3' ends by additional 
nucleic acid of the T13 9 gene to allow for homologous 
5 recombination to occur between the exogenous T139 gene 
carried by the vector and an endogenous T139 gene in an 
embryonic stem cell. The additional flanking T139 
nucleic acid is of sufficient length for successful 
homologous recombination with the endogenous gene. 

10 Typically, several kilobases of flanking DNA (both at the 
5' and 3' ends), are included in the vector (see e.g., 
Thomas and Capecchi (1987) Cell 51:503 for a description 
of homologous recombination vectors) . The vector is 
introduced into an embryonic stem cell line (e*g., by 

15 elect roporat ion) and cells in which the introduced T139 
gene has homologously recombined with the endogenous T139 
gene are selected (see e.g., Li et al . (1992) Cell 
69:915) . The selected cells are then injected into a 
blastocyst of an animal (e.g., a mouse) to form 

20 aggregation chimeras (see, e.g., Bradley in 

Teratocarcinomas and Embiryonic Stem Cells: A Practical 
Approach, Robertson, ed. (IRL, Oxford, 1987) pp. 113- 
152) . A chimeric embryo can then be implanted into a 
suitable pseudopregnant female foster animal and the 

25 embryo brought to term. Progeny harboring the 

homologously recombined DNA in their germ cells can be 
used to breed animals in which all cells of the animal 
contain the homologously recombined DNA by germline 
transmission of the transgene. Methods for constructing 

3 0 homologous recombination vectors and homologous 

recombinant animals are described further in Bradley 
(1991) Current Opinion in Bio/Technology 2:823-829 and in 
PCT Publication Nos . WO 90/11354, WO 91/01140, WO 
92/0968, and WO 93/04169. 
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In another embodiment, transgenic non-humans 
animals can be produced which contain selected systems 
which allow for regulated expression of the transgene. 
One example of such a system is the cre/loxP recombinase 
5 system of bacteriophage PI. For a description of the 
cre/loxP recombinase system, see, e.g., Lakso et al. 
(1992) Proc. Natl, Acad. Sci. USA 89:6232-6236. Another 
example of a recombinase system is the FLP recombinase 
system of Saccharomyces cerevisiae (O' Gorman et al. 

10 (1991) Science 251:1351-1355. If a cre/loxP recombinase 
system is used to regulate expression of the transgene, 
animals containing transgenes encoding both the Cre 
recombinase and a selected protein are required. Such 
animals can be provided through the construction of 

15 "double" transgenic animals, e.g., by mating two 

transgenic animals, one containing a transgene encoding a 
selected protein and the other containing a transgene 
encoding a recombinase. 

Clones of the non-human transgenic animals 

20 described herein can also be produced according to the 

methods described in Wilmut et al . (1997) Nature 385:810- 
813 and PCT Publication Nos. WO 97/07668 and WO 97/07669. 
In brief, a cell, e.g., a somatic cell, from the 
transgenic animal can be isolated and induced to exit the 

25 growth cycle and enter phase. The quiescent cell can 
then be fused, e.g., through the use of electrical 
pulses, to an enucleated oocyte from an animal of the 
same species from which the quiescent cell is isolated. 
The reconstructed oocyte is then cultured such that it 

30 develops to morula or blastocyte and then transferred to 
pseudopregnant female foster animal. The offspring borne 
of this female foster animal will be a clone of the 
animal from which the cell, e.g., the somatic cell, is 
isolated. 
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TV. Pharmaceutical Compositions 

The T139 nucleic acid molecules, T139 proteins, 
and anti-T139 antibodies (also referred to herein as 
"active compounds") of the invention can be incorporated 
5 into pharmaceutical compositions suitable for 

administration. Such compositions typically comprise the 
nucleic acid molecule, protein, or antibody and a 
pharmaceutically acceptable carrier. As used herein the 
language "pharmaceutically acceptable carrier" is 
10 intended to include any and all solvents, dispersion 
media, coatings, antibacterial and antifungal agents, 
isotonic and absorption delaying agents, and the like, 
compatible with pharmaceutical administration. The use 
of such media and agents for pharmaceutically active 
15 substances is well known in the art. Except insofar as 
any conventional media or agent is incompatible with the 
active compound, use thereof in the compositions is 
contemplated. Supplementary active compounds can also be 
incorporated into the compositions. 
20 A pharmaceutical composition of the invention is 

formulated to be compatible with its intended route of 
administration. Examples of routes of administration 
include parenteral , e.g., intravenous , intradermal , 
subcutaneous, oral (e.g., inhalation), transdermal 
25 (topical), transmucosal , and rectal administration. 
Solutions or suspensions used for parenteral, 
intradermal, or subcutaneous application can include the 
following components: a sterile diluent such as water for 
injection, saline solution, fixed oils, polyethylene 
30 glycols, glycerine, propylene glycol or other synthetic 
solvents; antibacterial agents such as benzyl alcohol or 
methyl parabens; antioxidants such as ascorbic acid or 
sodium bisulfite; chelating agents such as 
ethylenediaminetetraacetic acid; buffers such as 
35 acetates, citrates or phosphates and agents for the 
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adjustment of tonicity such as sodium chloride or 
dextrose. pH can be adjusted with acids or bases, such 
as hydrochloric acid or sodium hydroxide. The parenteral 
preparation can be enclosed in ampoules, disposable 
5 syringes or multiple dose vials made of glass or plastic. 

Pharmaceutical compositions suitable for 
injectable use include sterile aqueous solutions (where 
water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable 

10 solutions or. dispersion. For intravenous administration, 
suitable carriers include physiological saline, 
bacteriostatic water, Cremophor EL™ (BASF; Parsippany, 
NJ) or phosphate buffered saline (PBS) . In all cases, 
the composition must be sterile and should be fluid to 

15 the extent that easy syringability exists. It must be 
stable under the conditions of manufacture and storage 
and must be preserved against the contaminating action of 
microorganisms such as bacteria and fungi. The carrier 
can be a solvent or dispersion medium containing, for 

20 example, water, ethanol, polyol (for example, glycerol, 
propylene glycol, and liquid polyetheylene glycol, and 
the like), and suitable mixtures thereof. The proper 
fluidity can be maintained, for example, by the use of a 
coating such as lecithin, by the maintenance of the 

25 required particle size in the case of dispersion and by 
the use of surfactants. Prevention of the action of 
microorganisms can be achieved by various antibacterial 
and antifungal agents, for example, parabens, 
chlorobutanol , phenol, ascorbic acid, thimerosal, and the 

30 like. In many cases, it will be preferable to include 
isotonic agents, for example, sugars, polyalcohols such 
as raannitol, sorbitol, sodium chloride in the 
composition. Prolonged absorption of the injectable 
compositions can be brought about by including in the 
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composition an agent which delays absorption, for 
example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by 
incorporating the active compound (e.g., a T139 protein 
5 or anti-T139 antibody) in the required amount in an 
appropriate solvent with one or a combination of 
ingredients enumerated above, as required, followed by 
filtered sterilization. Generally, dispersions are 
prepared by incorporating the active compound into a 

10 sterile vehicle which contains a basic dispersion medium 
and the required other ingredients from those enumerated 
above. In the case of sterile powders for the 
preparation of sterile injectable solutions, the 
preferred methods of preparation are vacuum drying and 

15 freeze-drying which yields a powder of the active 

ingredient plus any additional desired ingredient from a 
previously sterile-filtered solution thereof. 

Oral compositions generally include an inert 
diluent or an edible carrier. They can be enclosed in 

2 0 gelatin capsules or compressed into tablets. For the 

purpose of oral therapeutic administration, the active 
compound can be incorporated with excipients and used in 
the form of tablets, troches, or capsules. Oral 
compositions can also be prepared using a fluid carrier 
25 for use as a mouthwash, wherein the compound in the fluid 
carrier is applied orally and swished and expectorated or 
swallowed. Pharmaceutically compatible binding agents, 
and/or adjuvant materials can be included as part of the 
composition. The tablets, pills, capsules, troches and 

3 0 the like can contain any of the following ingredients, or 

compounds of a similar nature: a binder such as 
microcrystalline cellulose, gum tragacanth or gelatin; an 
excipient such as starch or lactose, a disintegrating 
agent such as alginic acid, Primogel, or corn starch; a 
35 lubricant such as magnesium stearate or Sterotes; a 
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glidant such as colloidal silicon dioxide; a sweetening 
agent such as sucrose or saccharin; or a flavoring agent 
such as peppermint, methyl salicylate, or orange 
flavoring. For administration by inhalation, the 

5 compounds are delivered in the form of an aerosol spray 
from pressured container or dispenser which contains a 
suitable propellant, e.g., a gas such as carbon dioxide, 
or a nebulizer. 

Systemic administration can also be by 
10 transmucosal or transdermal means. For transmucosal or 
transdermal administration, penetrants appropriate to the 
barrier to be permeated are used in the formulation. 
Such penetrants are generally known in the art, and 
include, for example, for transmucosal administration, 
15 detergents, bile salts, and fusidic acid derivatives. 
Transmucosal administration can be accomplished through 
the use of nasal sprays or suppositories. For 
transdermal administration, the active compounds are 
formulated into ointments, salves, gels, or creams as 
20 generally known in the art. 

The compounds can also be prepared in the form of 
suppositories (e.g., with conventional suppository bases 
such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 
25 In one embodiment, the active compounds are 

prepared with carriers that will protect the compound 
against rapid elimination from the body, such as a 
controlled release formulation, including implants and 
microencapsulated delivery systems. Biodegradable, 
30 biocompatible polymers can be used, such as ethylene 
vinyl acetate, polyanhydrides , polyglycolic acid, 
collagen, polyorthoesters, and polylactic acid. Methods 
for preparation of such formulations will be apparent to 
those skilled in the art. The materials can also be 
35 obtained commercially from Alza Corporation and Nova 
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10 



Pharmaceuticals, Inc. Liposomal suspensions (including 
liposomes targeted to infected cells with monoclonal 
antibodies to viral antigens) can also be used as 
pharmaceutically acceptable carriers. These can be 
prepared according to methods known to those skilled in 
the art, for example, as described in U.S. Patent No. 
4,522,811. 

It is especially advantageous to formulate oral or 
parenteral compositions in dosage unit form for ease of 
administration and uniformity of dosage. Dosage unit 
form as used herein refers to physically discrete units 
suited as unitary dosages for the subject to be treated; 
each xinit containing a predetermined quantity of active 
compound calculated to produce the desired therapeutic 
effect in association with the required pharmaceutical 
carrier. The specification for the dosage unit forms of 
the invention are dictated by and directly dependent on 
the unique characteristics of the active compoiind and the 
particular therapeutic effect to be achieved, and the 
limitations inherent in the art of con^jounding such an 
active compound for the treatment of individuals. 

The nucleic acid molecules of the invention can be 
inserted into vectors and used as gene therapy vectors. 
Gene therapy vectors can be delivered to a subject by, 
for example, intravenous injection, local administration 
(see U.S. Patent 5,328,470) or by stereotactic injection 
(see e.g., Chen et al . (1994) Proc. Natl. Acad. Sci. USA 
91:3054-3057). The pharmaceutical preparation of the 
gene therapy vector can include the gene therapy vector 
in an acceptable diluent, or can comprise a slow release 
matrix in which the gene delivery vehicle is imbedded. 
Alternatively, where the complete gene delivery vector 
can be produced intact from recombinant cells, e.g. 
retroviral vectors, the pharmaceutical preparation can 
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include one or more cells which produce the gene delivery 
system. 

The pharmaceutical conqpositions can be included in 
a container, pack, or dispenser together with 
5 instructions for administration. 

V TTfiRS and Methods of the In vention 

The nucleic acid molecules, proteins, protein 
homologues, and antibodies described herein can be used 
in one or more of the following methods: a) screening 
10 assays; bh detection assays (e.g., chromosomal mapping, 
tissue typing, forensic biology), c) predictive medicine 
(e.g., diagnostic assays, prognostic assays, monitoring 
clinical trials, and pharmacogenomics) ; and d) methods of 
treatment (e.g., therapeutic and prophylactic) . A T139 
15 protein interacts with other cellular proteins and can 
thus be used for (i) regulation of cellular 
proliferation; (ii) regulation of cellular 
differentiation; and (iii) regulation of cell survival. 
The isolated nucleic acid molecules of the invention can 
20 be used to express T139 protein (e.g., via a recombinant 
expression vector in a host cell in gene therapy 
applications), to detect T139 mRNA (e.g.. in a biological 
sattple) or a genetic lesion in a T139 gene, and to 
modulate T139 activity. In addition, the T139 proteins 
25 can be used to screen drugs or compounds which modulate 
the T139 activity or expression as well as to treat 
disorders characterized by insufficient or excessive 
production of T139 protein or production of T139 protein 
forms which have decreased or aberrant activity compared 
30 to T13 9 wild type protein. In addition, the anti-T139 
antibodies of the invention can be used to detect and 
isolate T139 proteins and modulate T139 activity. 
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This invention further pertains to novel agents 
identified by the above-described screening assays and 
uses thereof for treatments as described herein. 

A. Screening Assays 
5 The invention provides a method (also referred to 

herein as a "screening assay") for identifying 
modulators, i.e., candidate or test compounds or agents 
(e.g., peptides, peptidomimetics, small molecules or 
other drugs) which bind to T139 proteins or have a 
10 stimulatory or inhibitory effect on, for example, T139 
expression or T139 activity. 

In one embodiment, the invention provides assays 
for screening candidate or test compounds which bind to 
or modulate the activity of a T139 polypeptide or 
15 biologically active portion thereof. The test compounds 
of the present invention can be obtained using any of the 
numerous approaches in combinatorial library methods 
known in the art, including: biological libraries; 
spatially addressable parallel solid phase or solution 
20 phase libraries; synthetic library methods requiring 
deconvolution; the 'one-bead one-compound' library 
method; and synthetic library methods using affinity 
chromatography selection. The biological library 
approach is limited to peptide libraries, while the other 
25 four approaches are applicable to peptide, non-peptide 
oligomer or small molecule libraries of compounds (Lam 
(1997) Anticancer Drug Des. 12:145) . 

Examples of methods for the synthesis of molecular 
libraries can be found in the art, for example in: 
30 DeWitt et al. (1993) Proc. Natl. Acad. Sci, U.S.A. 

90:6909; Erb et al . (1994) Proc. Natl. Acad. Sci . USA 
91:11422; Zuckermann et al . (1994). J". Med. Chem. 
37:2678; Cho et al . (1993) Science 261:1303; Carrell et 
al. (1994) Angew. Chem. Int. Ed. Engl. 33:2059; Carell et 
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al. (1994) Angew. Chem. Int. Ed. Engl. 33:2061; and 
Gallop et al. (1994) J. Med. Chem. 37:1233. 

Libraries of compoiinds may be presented in 
solution (e.g., Houghten (1992) Bio/Techniques 13:412- 
5 421), or on beads (Lam (1991) Nature 354 : 82-84) , chips 
(Fodor (1993) Nature 364:555-555), bacteria (U.S. Patent 
No. 5,223,409), spores (Patent Nos. 5,571,698; 5,403,484; 
and 5,223,409), plasmids (Cull et al. (1992) Proc. Natl. 
Acad. Sci. USA 89:1865-1869) or on phage (Scott and Smith 
10 (1990) Science 249:386-390; Devlin (1990) Science 

249:404-406; Cwirla et al . (1990) Proc. Natl. Acad. Sci. 
87:6378-6382; and Felici (1991) J. Mol. Biol. 222:301- 
310) . 

In one embodiment, an assay is a cell-based assay 

15 in which a cell which expresses T139 protein, or a 

biologically active portion thereof, is contacted with a 
test compound and the ability of the test compound to 
bind to a T139 protein determined. The cell, for 
example, can be a yeast cell or a cell of mammalian 

20 origin. Determining the ability of the test compound to 
bind to the T139 protein can be accomplished, for 
example, by coupling the test coitpovmd with a 
radioisotope or enzymatic label such that binding of the 
test compound to the T139 protein or biologically active 

25 portion thereof can be determined by detecting the 
labeled compound in a complex. For exanple, test 
compounds can be labeled with ^^^I, "S, "C, or ^H, either 
directly or indirectly, and the radioisotope detected by 
direct counting of radioemmission or by scintillation 

30 counting. Alternatively, test compounds can be 

enzymatically labeled with, for example, horseradish 
peroxidase, alkaline phosphatase, or lucif erase, and the 
enzymatic label detected by determination of conversion 
of an appropriate substrate to product. In a preferred 

35 embodiment, the assay comprises contacting a cell which 
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expresses a T139 protein, or a biologically active 
portion thereof, with a knovm compound which binds T139 
to form an assay mixture, contacting the assay mixture 
with a test compound, and determining the ability of the 
5 test compound to interact with a T139 protein, wherein 
determining the ability of the test compound to interact 
with a T139 protein comprises determining the ability of 
the test compound to preferentially bind to T139 or a 
biologically active portion thereof as compared to the 

10 known compound. 

In another embodiment, an assay is a cell-based 
assay comprising contacting a cell expressing a T13 9 
protein, or a biologically active portion thereof, with a 
test compound and determining the ability of the test 

15 compound to modulate (e.g., stimulate or inhibit) the 
activity of the T139 protein or biologically active 
portion thereof. Determining the ability of the test 
compound to modulate the activity of T13 9 or a 
biologically active portion thereof can be accomplished, 

20 for example, by deteormining the ability of the T139 
protein to bind to or interact with a T139 target 
molecule. As used herein, a "target molecule" is a 
molecule with which a T139 protein binds or interacts in 
nature, for example, a T13 9 receptor, a molecule in the 

25 extracellular milieu, or a molecule associated with the 
external surface of a cell membrane. A T139 target 
molecule can be a non-T139 molecule or a T139 protein or 
polypeptide of the present invention. In one embodiment, 
a T13 9 target molecule is a component of a signal 

30 transduction pathway which facilitates transduction of an 
extracellular signal (e.g., a signal generated by binding 
of a T139 molecule to its receptor) through the cell 
membrane and into the cell. 

Determining the ability of the T13 9 protein to 
35 bind to or interact with a T139 target molecule can be 
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accomplished by one of the methods described above for 
detennining direct binding. In a preferred embodiment, 
determining the ability of the T139 protein to bind to or 
interact with a T139 target molecule can be accomplished 
5 by determining the activity of the target molecule. For 
example, the activity of the target molecule can be 
determined by detecting induction of a cellular second 
messenger of the target (e.g., intracellular Ca^*, 
diacylglycerol, IP3, etc.), detecting catalytic/enzymatic 
10 activity of the target an appropriate substrate, 

detecting the induction of a reporter gene (e.g., a T139- 
responsive regulatory element operatively linked to a 
nucleic acid encoding a detectable marker, e.g. 
luciferase), or detecting a cellular response, for 
15 example, cell survival, cellular differentiation, or cell 
prol if erat ion . 

In yet another embodiment, an assay of the present 
invention is a cell-free assay comprising contacting a 
T139 protein or biologically active portion thereof with 
20 a test compound and determining the ability of the test 
compound to bind to the T139 protein or biologically 
active portion thereof. Binding of the test compound to 
the T139 protein can be determined either directly or 
indirectly as described above. In a preferred 
25 embodiment, the assay includes contacting the T139 
protein or biologically active portion thereof with a 
known compound which binds T139 to form an assay mixture, 
contacting the assay mixture with a test compound, and 
determining the ability of the test compound to interact 
30 with a T139 protein, wherein determining the ability of 
the test compound to interact with a T139 protein 
comprises determining the ability of the test compound to 
preferentially bind to T139 or biologically active 
portion thereof as compared to the known compound. 
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in another embodiment, an assay is a cell-free 
assay comprising contacting T139 protein or biologically 
active portion thereof with a test compound and 
determining the ability of the test compound to modulate 
5 (e.g., stimulate or inhibit) the activity of the T139 
protein or biologically active portion thereof. 
Determining the ability of the test compound to modulate 
the activity of T139 can be accomplished, for example, by 
determining the ability of the T139 protein to bind to a 
10 T139 target molecule by one of the methods described 

above for determining direct binding. In an alternative 
embodiment, determining the ability of the test compound 
to modulate the activity of T139 can be accomplished by 
determining the ability of the T139 protein further 
15 modulate a T139 target molecule. For example, the 

catalytic/enzymatic activity of the target molecule on an 
appropriate substrate can be determined as previously 
described. 

In yet another embodiment, the cell -free assay 
20 comprises contacting the T139 protein or biologically 
active portion thereof with a known compound which binds 
T139 to form an assay mixture, contacting the assay 
mixture with a test compound, and determining the ability 
of the test compound to interact with a T139 protein. 
25 wherein determining the ability of the test compound to 
interact with a T139 protein comprises determining the 
ability of the T139 protein to preferentially bind to or 
modulate the activity of a T139 target molecule. 

in some cell -based assays, it may be desirable to 
30 utilize a solubilizing agent such that a membrane -bound 
form of T139 is- maintained in solution. Examples of such 
solubilizing agents include non-ionic detergents such as 
n-octylglucoside, n-dodecylglucoside. n-dodecylmaltoside, 
octanoyl-N-methylglucamide. decanoyl-N-methylglucamide. 
35 Triton® X-100, Triton® X-114, Thesit®. 
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Isotridecypoly (ethylene glycol ether)n, 3-[{3- 
cholamidopropyDdimethylamminio] -1-propane sulfonate 
(CHAPS) , 3-1 (3-cholamidopropyl) dimethylamminio] -2- 
hydroxy-l-propane sulfonate (CHAPSO) , or N-dodecyl=N,N- 
dimethyl-3-atnmonio-l-propane sulfonate . 

In more than one embodiment of the above assay 
methods of the present invention, it may be desirable to 
immobilize either T139 or its target molecule to 
facilitate separation of complexed from uncomplexed forms 
of one or both of the proteins, as well as to accommodate 
automation of the assay. Binding of a test compound to 
T139. or interaction of T139 with a target molecule in 
the presence and absence of a candidate compound, can be 
accomplished in any vessel suitable for containing the 
15 reactants. Examples of such vessels include microtitre 
plates, test tubes, and micro- centrifuge tubes. In one 
embodiment, a fusion protein can be provided which adds a 
domain that allows one or both of the proteins to be 
bound to a matrix. For example, glutathione- S- 
20 transferase/T139 fusion proteins or glutathione-S- 

transferase/target fusion proteins can be adsorbed onto 
glutathione sepharose beads (Sigma Chemical; St. Louis, 
MO) or glutathione derivatized microtitre plates, which 
are then combined with the test compound or the test 
25 compound and either the non-adsorbed target protein or 
T139 protein, and the mixture incubated under conditions 
conducive to complex formation (e.g., at physiological 
conditions for salt and pH) . Following incubation, the 
beads or microtitre plate wells are washed to remove any 
30 unbound components, the matrix immobilized in the case of 
beads, complex determined either directly or indirectly, 
for example, as described above. Alternatively, the 
complexes can be dissociated from the matrix, and the 
level of T139 binding or activity determined using 
35 standard techniques. 
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Other techniques for immobilizing proteins on 
matrices can also be used in the screening assays of the 
invention. For example, either T139 or its target 
molecule can be immobilized utilizing conjugation of 
5 biotin and streptavidin. Biotinylated T139 or target 
tnolecules can be prepared from biotin-NHS (N-hydroxy- 
Buccinimide) using techniques well known in the art 
(e.g., biotinylation kit. Pierce Chemicals; Rockford, 
ID , Ind immobilized in the wells of streptavidin- coated 
10 96 well plates (Pierce Chemical) . Alternatively, 

antibodies reactive with T139 or target molecules but 
which do not interfere with binding of the T139 protein 
to its target molecule can be derivatized to the wells of 
the plate, and unbound target or T139 trapped in the 
15 wells by antibody conjugation. Methods for detecting 

such complexes, in addition to those described above for 
the GST-immobilized complexes, include immunodetection of 
complexes using antibodies reactive with the T139 or 
target molecule, as well as enzyme-linked assays which 
20 rely on detecting an enzymatic activity associated with 
the T139 or target molecule. 

In another embodiment, modulators of T139 
expression are identified in a method in which a cell is 
contacted with a candidate compound and the expression of 
25 T139 mRNA or protein in the cell is determined. The 
level of expression of T139 mRNA or protein in the 
presence of the candidate compound is compared to the 
level of expression of T139 mRNA or protein in the 
absence of the candidate compound. The candidate 
30 compound can then be identified as a modulator of T139 
expression based on this comparison. For example, when 
expression of T139 mRNA or protein is greater 
(statistically significantly greater) in the presence of 
the candidate compound than in its absence, the candidate 
35 compound is identified as a stimulator of T139 mRNA or 
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protein expression. Alternatively, when expression of 
T139 mRNA or protein is less (statistically significantly 
less) in the presence of the candidate compound than in 
its absence, the candidate compovind is identified as an 
5 inhibitor of T139 mRNA or protein expression. The level 
of T139 mRNA or protein expression in the cells can be 
determined by methods described herein for detecting T139 

tnRNA or protein. 

In yet another aspect of the invention, the T139 
10 proteins can be used as "bait proteins" in a two-hybrid 
assay or three hybrid assay (see, e.g., U.S. Patent No. 
5,283,317; Zervos et al. (1993) Cell 72:223-232; Madura 
et al'. (1993) J- Biol. Chem. 268:12046-12054; Bartel et 
al. (1993) Bio/Techniquea 14:920-924; Iwabuchi et al. 
15 (1993) oncogene 8:1693-1696; and WO 94/10300) , to 

identify other proteins, which bind to or interact with 
T139 ("T139-binding proteins" or "T139-bp") and modulate 
T139 activity. Such T139-binding proteins are also 
likely to be involved in the propagation of signals by 
20 the T139 proteins as, for example, upstream or downstream 
elements of the T139 pathway. 

The two-hybrid system is based on the modular 
nature of most transcription factors, which consist of 
separable DNA-binding and activation domains. Briefly, 
25 the assay utilizes two different DNA constructs. In one 
construct, the gene that codes for T139 is fused to a 
gene encoding the DNA binding domain of a known 
transcription factor (e.g., GAL-4) . In the other 
construct, a DNA sequence, from a library of DNA 
30 sequences, that encodes an unidentified protein ("prey" 
or "sample") is fused to a gene that codes for the 
activation domain of the known transcription factor. If 
the "bait" and the "prey" proteins are able to interact, 
in vivo, forming an T13 9 -dependent complex, the DNA- 
35 binding and activation domains of the transcription 
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factor are brought into close proximity. This proximity 
allows transcription of a reporter gene (e.g.. LacZ) 
which is operably linked to a transcriptional regulatory 
site responsive to the transcription factor. Expression 
of the reporter gene can be detected and cell colonies 
containing the functional transcription factor can be 
isolated and used to obtain the cloned gene which encodes 
the protein which interacts with T139. 

This invention further pertains to novel agents 
identified by the above -described screening assays and 
uses thereof for treatments as described herein. 



B. Detectio n Assays 

Portions or fragments of the cDNA sequences 
identified herein (and the corresponding complete gene 

15 sequences) can be used in numerous ways as polynucleotide 
reagents. For example, these sequences can be used to: 
(i) map their respective genes on a chromosome; and, 
thus, locate gene regions associated with genetic 
disease; (ii) identify an individual from a minute 

20 biological sample (tissue typing) ; and (iii) aid xn 
forensic identification of a biological sample. These 
applications are described in the subsections below. 

1 fh-rnmoBOm ''^ Mapping 

Once the sequence (or a portion of the sequence) 
25 of a gene has been isolated, this sequence can be used to 
map the location of the gene on a chromosome. 
Accordingly, T139 nucleic acid molecules described herein 
or fragments thereof, can be used to map the location of 
T139 genes on a chromosome. The mapping of the T139 
30 sequences to chromosomes is an important first step in 
correlating these sequences with genes associated with 
disease. 
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Briefly, T139 genes can be mapped to chromosomes 
by preparing PGR primers (preferably 15-25 bp in length) 
from the T139 sequences. Computer analysis of T139 
sequences can be used to rapidly select primers that do 
5 not span more than one exon in the genomic DNA, thus 
complicating the amplification process. These primers 
can then be used for PGR screening of somatic cell 
hybrids containing individual human chromosomes. Only 
those hybrids containing the human gene corresponding to 
10 the T139 sequences will yield an amplified fragment. 

somatic cell hybrids are prepared by fusing 

somatic cells from different mammals (e.g., human and 
mouse cells) . As hybrids of human and mouse cells grow 
and divide, they gradually lose human chromosomes in 
15 random order, but retain the mouse chromosomes. By using 
media in which mouse cells cannot grow, because they lack 
a particular enzyme, but human cells can, the one human 
chromosome that contains the gene encoding the needed 
enzyme, will be retained. By using various media, panels 
20 of hybrid cell lines can be established. Each cell line 
in a panel contains either a single human chromosome or a 
small number of human chromosomes, and a full set of 
mouse chromosomes, allowing easy mapping of individual 
genes to specific human chromosomes. (D'Eustachio et al . 
25 (1983) science 220:919-924). Somatic cell hybrids 

containing only fragments of human chromosomes can also 
be produced by using human chromosomes with 
translocations and deletions. 

PGR mapping of somatic cell hybrids is a rapid 
30 procedure for assigning a particular sequence to a 

particular chromosome. Three or more sequences can be 
assigned per day using a single thermal cycler. Using 
the T139 sequences to design oligonucleotide primers, 
sublocalization can be achieved with panels of fragments 
35 from specific chromosomes. Other mapping strategies 
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Which can similarly be used to map a T139 sequence to its 
chromosome include in situ hybridization (described in 
Fan et al. (1990) Proc. Natl. Acad. Sci. USA 87:6223-27), 
pre-screening with labeled flow-sorted chromosomes, and 
5 pre-selection by hybridization to chromosome specific 

cDNA libraries. 

Fluorescence in situ hybridization (FISH) of a DNA 
sequence to a metaphase chromosomal spread can further be 
used to provide a precise chromosomal location in one 
10 step. Chromosome spreads can be made using cells whose 
division has been blocked in metaphase by a chemical like 
colcemid that disrupts the mitotic spindle. The 
chromosomes can be treated briefly with trypsin, and then 
stained with Giemsa. A pattern of light and dark bands 
15 develops on each chromosome, so that the chromosomes can 
be identified individually. The FISH technique can be 
used with a DNA sequence as short as 500 or 600 bases. 
However, clones larger than 1,000 bases have a higher 
likelihood of binding to a unique chromosomal location 
20 with sufficient signal intensity for simple detection. 
Preferably 1,000 bases, and more preferably 2,000 bases 
will suffice to get good results at a reasonable amount 
of time. For a review of this technique, see Verma et 
al . , Human Chromosomes : A Manual of Basic Techniques 
25 (Pergamon Press, New York, 1988) . 

Reagents for chromosome mapping can be used 
individually to mark a single chromosome or a single site 
on that chromosome, or panels of reagents can be used for 
marking multiple sites and/or multiple chromosomes. 
30 Reagents corresponding to noncoding regions of the genes 
actually are preferred for mapping purposes. Coding 
sequences are more likely to be conserved within gene 
families, thus increasing the chance of cross 
hybridizations during chromosomal mapping. 
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Once a sequence has been mapped to a precise 
chromosomal location, the physical position of the 
sequence on the chromosome can be correlated with genetic 
map data. (Such data are found, for example, in V. 
5 McKusick, Mendelian Inheritance in Man, available on-line 
through Johns Hopkins University Welch Medical Library) . 
The relationship between genes and disease, mapped to the 
same chromosomal region, can then be identified through 
linkage analysis (co-inheritance of physically adjacent 
10 genes), described in, e.g., Egeland et al. (1987) Mature, 
325:783-787. 

Moreover, differences in the DNA sequences between 
individuals affected and unaffected with a disease 
associated with the T139 gene can be determined. If a 

15 mutation is observed in some or all of the affected 

individuals but not in any unaffected individuals, then 
the mutation is likely to be the causative agent of the 
particular disease. Comparison of affected and 
unaffected individuals generally involves first looking 

20 for structural alterations in the chromosomes such as 
deletions or translocations that are visible from 
chromosome spreads or detectable using PGR based on that 
DNA sequence. Ultimately, complete sequencing of genes 
from several individuals can be performed to confirm the 

25 presence of a mutation and to distinguish mutations from 
polymorphisms . 

9 TiflHue Typing 

The T139 sequences of the present invention can 
also be used to identify individuals from minute 
30 biological samples. The United States military, for 
example, is considering the use of restriction fragment 
length polymorphism (RFLP) for identification of its 
personnel. In this technique, an individual's genomic 
DNA is digested with one or more restriction enzymes, and 
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probed on a Southern blot to yield unique bands for 
identification. This method does not suffer from the 
current limitations of "Dog Tags" which can be lost, 
switched, or stolen, making positive identification 
5 difficult. The sequences of the present invention are 
useful as additional DNA markers for RFLP (described in 
U.S. Patent 5,272,057) . 

Furthermore, the sequences of the present 
invention can be used to provide an alternative technique 
10 which determines the actual base-by-base DNA sequence of 
selected portions of an individual's genome. Thus, the 
T139 sequences described herein can be used to prepare 
two PGR primers from the 5' and 3' ends of the sequences. 
These primers can then be used to amplify an individual's 
15 DNA and subsequently sequence it. 

Panels of corresponding DNA sequences from 
individuals, prepared in this manner, can provide unique 
individual identifications, as each individual will have 
a unique set of such DNA sequences due to allelic 
20 differences. The sequences of the present invention can 
be used to obtain such identification sequences from 
individuals and from tissue. The T139 sequences of the 
invention uniquely represent portions of the human 
genome. Allelic variation occurs to some degree in the 
25 coding regions of these sequences, and to a greater 
degree in the noncoding regions. It is estimated that 
allelic variation between individual humans occurs with a 
frequency of about once per each 500 bases. Each of the 
sequences described herein can, to some degree, be used 
30 as a standard against which DNA from an individual can be 
compared for identification purposes. Because greater 
numbers of polymorphisms occur in the noncoding regions, 
fewer sequences are necessary to differentiate 
individuals. The noncoding sequences of SEQ ID N0:1 can 
35 comfortably provide positive individual identification 



wo 99/54343 



PCTAJS99/08896 



- 71 - 



With a panel of perhaps 10 to 1,000 primers which each 
yield a noncoding amplified sequence of 100 bases. If 
predicted coding sequences, such as those in SEQ ID NO: 3 
are used, a more appropriate number of primers for 
5 positive individual identification would be 500-2,000. 
If a panel of reagents from T139 sequences 
described herein is used to generate a unique 
identification database for an individual, those same 
reagents can later be used to identify tissue from that 
10 individual, using the unique identification database, 
positive identification of the individual, living or 
dead, can be made from extremely small tissue samples. 

TTHfi of Pa-rfial T i^Q SeouenreB in Forensjc 

Biology 

15 DNA-based identification techniques can also be 

used in forensic biology. Forensic biology is a 
scientific field ettploying genetic typing of biological 
evidence found at a crime scene as a means for positively 
identifying, for example, a perpetrator of a crime. To 
20 make such an identification, PGR technology can be used 
to amplify DNA sequences taken from very small biological 
samples such as tissues, e.g., hair or skin, or body 
fluids, e.g., blood, saliva, or semen found at a crime 
scene. The amplified sequence can then be compared to a 
25 standard, thereby allowing identification of the origin 
of the biological sample. 

The sequences of the present invention can be used 
to provide polynucleotide reagents, e.g.. PGR primers, 
targeted to specific loci in the human genome, which can 
30 enhance the reliability of DNA-based forensic 

identifications by, for example, providing another 
"identification marker" (i.e. another DNA sequence that 
is unique to a particular individual) . As mentioned 
above, actual base sequence information can be used for 
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identification as an accurate alternative to patterns 
formed by restriction enzyme generated fragments. 
Sequences targeted to noncoding regions of SEQ ID N0:1 
are particularly appropriate for this use as greater 
5 numbers of polymorphisms occur in the noncoding regions, 
making it easier to differentiate individuals using this 
technique. Examples of polynucleotide reagents include 
the T139 sequences or portions thereof, e.g., fragments 
derived from the noncoding regions of SEQ ID N0:1 having 
10 a length of at least 20 or 30 bases. 

The T139 sequences described herein can further be 
used to provide polynucleotide reagents, e.g., labeled or 
labelable probes which can be used in, for example, an in 
situ hybridization technique, to identify a specific 
15 tissue, e.g., brain tissue. This can be very useful in 
cases where a forensic pathologist is presented with a 
tissue of unknown origin. Panels of such T139 probes can 
be used to identify tissue by species and/or by organ 
type . 

20 In a similar fashion, these reagents, e.g., T139 

primers or probes can be used to screen tissue culture 
for contamination (i.e., screen for the presence of a 
mixture of different types of cells in a culture) . 



C . Predictive Medicine 

25 The present invention also pertains to the field 

of predictive medicine in which diagnostic assays, 
prognostic assays, pharmacogenomics, and monitoring 
clinical trails are used for prognostic (predictive) 
purposes to thereby treat an individual prophylactically. 

30 Accordingly, one aspect of the present invention relates 
to diagnostic assays for determining T139 protein and/ or 
nucleic acid expression as well as T139 activity, in the 
context of a biological sample (e.g., blood, serum, 
cells, tissue) to thereby determine whether an individual 



wo 99/54343 



PCT/US99/08896 



- 73 



is afflicted with a disease or disorder, or is at risk of 
developing a disorder, associated with aberrant T139 
expression or activity. The invention also provides for 
prognostic (or predictive) assays for determining whether 
5 an individual is at risk of developing a disorder 

associated with T139 protein, nucleic acid expression or 
activity. For example, mutations in a T139 gene can be 
assayed in a biological sample. Such assays can be used 
for prognostic or predictive purpose to thereby 
10 prophylactically treat an individual prior to the onset 
of a disorder characterized by or associated with T139 
protein, nucleic acid expression or activity. 

Another aspect of the invention provides methods 
for determining T139 protein, nucleic acid expression or 
15 T139 activity in an individual to thereby select 

appropriate therapeutic or prophylactic agents for that 
individual (referred to herein as "pharmacogenomics") . 
Pharmacogenomics allows for the selection of agents 
(e.g., drugs) for therapeutic or prophylactic treatment 
20 of an individual based on the genotype of the individual 
(e.g., the genotype of the individual examined to 
determine the ability of the individual to respond to a 

particular agent . ) 

Yet another aspect of the invention pertains to 
25 monitoring the influence of agents (e.g., drugs or other 
compounds) on the expression or activity of T139 in 

clinical trials. 

These and other agents are described in further 

detail in the following sections. 



30 



1 . niaqnostic Assays 

An exemplary method for detecting the presence or 
absence of T139 in a biological sample involves obtaining 
a biological sample from a test subject and contacting 
the biological sample with a compound or an agent capable 
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Of detecting T139 protein or nucleic acid (e.g., mRNA, 
genomic DNA) that encodes T139 protein such that the 
presence of T139 is detected in the biological sample. A 
preferred agent for detecting T139 mRNA or genomic DNA is 
5 a labeled nucleic acid probe capable of hybridizing to 
T139 mRNA or genomic DNA. The nucleic acid probe can be, 
for example, a full-length T139 nucleic acid, such as the 
nucleic acid of SEQ ID NO: 1 or 3, or a portion thereof, 
such as an oligonucleotide of at least 15, 30, 50, 100, 
LO 250 or 500 nucleotides in length and sufficient to 

specifically hybridize under stringent conditions to T139 
tnRNA or genomic DNA. Other suitable probes for use m 
the diagnostic assays of the invention are described 
herein. 

15 A preferred agent for detecting T139 protein is an 

antibody capable of binding to T139 protein, preferably 
an antibody with a detectable label. Antibodies can be 
polyclonal, or more preferably, monoclonal. An intact 
antibody, or a fragment thereof (e.g., Fab or F(ab').) can 
20 be used. The term "labeled", with regard to the probe or 
antibody, is intended to encompass direct labeling of the 
probe or antibody by coupling (i.e., physically linking) 
a detectable substance to the probe or antibody, as well 
as indirect labeling of the pr6be or antibody by 
25 reactivity with another reagent that is directly labeled. 
Examples of indirect labeling include detection of a 
primary antibody using a f luorescently labeled secondary 
antibody and end-labeling of a DNA probe with biotin such 
that it can be detected with f luorescently labeled 
30 streptavidin. The term "biological sample- is intended 
to include tissues, cells and biological fluids isolated 
from a subject, as well as tissues, cells and fluids 
present within a subject. That is, the detection method 
of the invention can be used to detect T13 9 mRNA, 
35 protein, or genomic DNA in a biological sample in vitro 
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as well as in vivo. For example, in vitro techniques for 
detection of T139 mRNA include Northern hybridizations 
and in situ hybridizations. In vitro techniques for 
detection of T139 protein include enzyme linked 
5 immunosorbent assays (ELISAs) , Western blots, 

immunoprecipitations and immunofluorescence. In vitro 
techniques for detection of T139 genomic DNA include 
southern hybridizations. Furthermore, in vivo techniques 
for detection of T139 protein include introducing into a 
10 subject a labeled anti-T139 antibody. For example, the 
antibody can be labeled with a radioactive marker whose 
presence and location in a subject can be detected by 
standard imaging techniques. 

in one embodiment, the biological sample contains 
15 protein molecules from the test subject. Alternatively 
the biological sample can contain mRNA molecules from the 
test subject or genomic DNA molecules from the test 
subject A preferred biological sample is a peripheral 
blood leukocyte sample isolated by conventional means 

20 from a subject. 

in another embodiment, the methods further involve 

obtaining a control biological sample from a control 

subject, contacting the control sample with a compound or 

agent capable of detecting T139 protein, mRNA, or genomic 

25 DNA, such that the presence of T139 protein, mRNA or 

genomic DNA is detected in the biological sample, and 

comparing the presence of T139 protein, mRNA or genomic 

DNA in the control sample with the presence of T139 

protein, mRNA or genomic DNA in the test sample. 

30 The invention also encompasses kits for detecting 

the presence of T139 in a biological • sample (a test 

sample) . Such kits can be used to determine if a subject 

is suffering from or is at increased risk of developing a 

disorder associated with aberrant expression of T139. 

35 For example, the kit can comprise a labeled compound or 
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agent capable of detecting T139 protein or itiRNA in a 
biological sample and means for determining the amount of 
T139 in the sample (e.g., an anti-T139 antibody or an 
oligonucleotide probe which binds to DNA encoding T139, 

5 e g., SEQ ID N0:1 or SEQ ID N0:3) . Kits may also include 
insticuction for observing that the tested subject is 
suffering from or is at risk of developing a disorder 
associated with aberrant expression of T139 if the amount 
of T139 protein or mRNA is above or below a normal level. 

10 For antibody-based kits, the kit may comprise, for 

example: (1) a first antibody (e.g., attached to a solid 
support) which binds to T139 protein; and, optionally, 
(2) a second, different antibody which binds to T139 
protein or the first antibody and is conjugated to a 

15 detectable agent. 

For oligonucleotide-based kits, the kit may 
comprise, for example: (1) a oligonucleotide, e.g., a 
detectably labelled oligonucleotide, which hybridizes to 
a T139 nucleic acid sequence or (2) a pair of primers 

20 useful for amplifying a T139 nucleic acid molecule. 

The kit may also comprise, e.g., a buffering 
agent, a preservative, or a protein stabilizing agent. 
The kit may also comprise components necessary for 
detecting the detectable agent (e.g., an enzyme or a 

25 substrate) . The kit may also contain a control sample or 
a series of control samples which can be assayed and 
compared to the test sample contained. Each component of 
the kit is usually enclosed within an individual 
container and all of the various containers are within a 

30 single package along with instructions for observing 
whether the -tested subject is suffering from or is at 
risk of developing a disorder associated with aberrant 
expression or activity of T139. 
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2 . Proanosf i c^ Assavs 

The methods described herein can furthermore be 
utilized as diagnostic or prognostic assays to identify 
subjects having or at risk of developing a disease or 
5 disorder associated with aberrant T13 9 expression or 

activity. For example, the assays described herein, such 
as the preceding diagnostic assays or the following 
assays, can be utilized to identify a subject having or 
at risk of developing a disorder associated with T139 

10 protein, nucleic acid expression or activity. 

Alternatively, the prognostic assays can be utilized to 
identify a subject having or at risk for developing such 
a disease or disorder. Thus, the present invention 
provides a method in which a test sample is obtained from 

15 a subject and T139 protein or nucleic acid (e.g., mRNA, 
genomic DNA) is detected, wherein the presence of T139 
protein or nucleic acid is diagnostic for a subject 
having or at risk of developing a disease or disorder 
associated with aberrant T13 9 expression or activity. As 

20 used herein, a "test sample" refers to a biological 

sample obtained from a subject of interest. For example, 
a test sample can be a biological fluid (e.g., serum), 

cell sample, or tissue. 

Furthermore, the prognostic assays described 
25 herein can be used to determine whether a subject can be 
administered an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small 
molecule, or other drug candidate) to treat a disease or 
disorder associated with aberrant T139 expression or 
30 activity. For example, such methods can be used to 

determine whether a subject can be effectively treated 
with a specific agent or class of agents (e.g.. agents of 
a type which decrease T13 9 activity) . Thus, the present 
invention provides methods for determining whether a 
35 subject can be effectively treated with an agent for a 
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disorder associated with aberrant T139 expression or 
activity in which a test sample is obtained and T139 
protein or nucleic acid is detected (e.g., wherein the 
presence of T139 protein or nucleic acid is diagnostic 
5 for a subject that can be administered the agent to treat 
a disorder associated with aberrant T139 expression or 
activity) . 

The methods of the invention can also be used to 
detect genetic lesions or mutations in a T139 gene, 
10 thereby determining if a subject with the lesioned gene 
is at risk for a disorder characterized by aberrant cell 
proliferation and/or differentiation. In preferred 
embodiments, the methods include detecting, in a sample 
of cells from the subject, the presence or absence of a 
15 genetic lesion characterized by at least one of an 

alteration affecting the integrity of a gene encoding a 
T139-protein, or the mis-expression of the T139 gene. 
For example, such genetic lesions can be detected by 
ascertaining the existence of at least one of 1) a 
20 deletion of one or more nucleotides from a T139 gene; 2) 
an addition of one or more nucleotides to a T139 gene; 3) 
a substitution of one or more nucleotides of a T139 gene, 
4) a chromosomal rearrangement of a T139 gene; 5) an 
alteration in the level of a messenger RNA transcript of 
25 a T139 gene, 6) aberrant modification of a T139 gene, 

such as of the methylation pattern of the genomic DNA, 7) 
the presence of a non-wild type splicing pattern of a 
messenger RNA transcript of a T139 gene, 8) a non-wild 
type level of a T139-protein, 9) allelic loss of a T139 
30 gene, and 10) inappropriate post-translational 

modification of a T139-protein. As described herein, 
there are a large number of assay techniques known in the 
art which can be used for detecting lesions in a T139 
gene. A preferred biological sample is a peripheral 
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blood leukocyte sample isolated by conventional means 

from a subject. 

in certain embodiments, detection of the lesion 
involves the use of a probe/primer in a polymerase chain 
5 reaction (PGR) (see, e.g., U.S. Patent Nos. 4.683.195 and 
4 683.202), such as anchor PGR or RAGE PGR. or, 
alternatively, in a ligation chain reaction (LGR) (see. 
e g , Landegran et al. (1988) Science 241:1077-1080; and 
Nakazawa et al. (1994) Proc. Natl. Acad. Sci. USA 91:360- 
10 364) , the latter of which can be particularly useful for 
detecting point mutations in the T139-gene (see Abravaya 
et al. (1995) Nucleic Acids J?es. 23:675-682) . This 
method can include the steps of collecting a sample of 
cells from a patient, isolating nucleic acid (e.g., 
15 genomic. mRNA or both) from the cells of the sample, 
contacting the nucleic acid sample with one or more 
primers which specifically hybridize to a T139 gene under 
conditions such that hybridization and amplification of 
the T139-gene (if present) occurs, and detecting the 
20 presence or absence of an amplification product, or 
detecting the size of the amplification product and 
comparing the length to a control sample. It is 
anticipated that PGR and/or LGR may be desirable to use 
as a preliminary amplification step in conjunction with 
25 any of the techniques used for detecting mutations 

described herein. 

Alternative amplification methods include: self 
sustained sequence replication (Guatelli et al. (1990) 
Proc. Natl. Acad. Sci. USA 87:1874-1878), transcriptional 

30 amplification system (Kwoh, et al. (1989) Proc. Natl. 
Acad. Sci. USA 86:1173-1177), Q-Beta Replicase (Lizardi 
et al. (1988) Bio/Technology 6:1197), or any other 
nucleic acid amplification method, followed by the 
detection of the amplified molecules using techniques 

35 well known to those of skill in the art. These detection 
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schemes are especially useful for the detection of 
nucleic acid molecules if such molecules are present m 

very low numbers. 

In an alternative embodiment, mutations in a T139 
5 gene from a sample cell can be identified by alterations 
in restriction enzyme cleavage patterns. For example, 
sample and control DNA is isolated, amplified 
(optionally) , digested with one or more restriction 
endonucleases, and fragment length sizes are determined 
10 by gel electrophoresis and compared. Differences in 
fragment length sizes between sample and control DNA 
indicates mutations in the sample DNA. Moreover, the use 
of sequence specific ribozymes (see, e.g., U.S. Patent 
NO 5 498,531) can be used to score for the presence of 
15 specific mutations by development or loss of a ribozyme 

cleavage site. 

in other embodiments, genetic mutations in T139 
can be identified by hybridizing a sample and control 
nucleic acids, e.g., DNA or RNA, to high density arrays 
20 containing hundreds or thousands of oligonucleotides 
probes (Cronin et al. (1996) Human Mutation 7:244-255; 
Kozal et al. (1996) Nature Medicine 2:753-759). For 
example, genetic mutations in T139 can be identified xn 
two-dimensional arrays containing light -generated DNA 
25 probes as described in Cronin et al. supra. Briefly, a 
first hybridization array of probes can be used to scan 
through long stretches of DNA in a sample and control to 
identify base changes between the sequences by making 
linear arrays of sequential overlapping probes. This 
30 step allows the identification of point mutations. This 
step is followed by a second hybridization array that 
allows the characterization of specific mutations by 
using smaller, specialized probe arrays complementary to 
all variants or mutations detected. Each mutation array 
35 is composed of parallel probe sets, one complementary to 
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the wild-type gene and the other complementary to the 

mutant gene. 

In yet another embodiment, any of a variety of 
sequencing reactions knovm in the art can be used to 

5 directly sequence the T139 gene and detect mutations by 
comparing the sequence of the sample T139 with the 
corresponding wild- type (control) sequence. Examples of 
sequencing reactions include those based on techniques 
developed by Maxim and Gilbert ((1977) Proc. Natl. Acad. 
10 sci. USA 74:560) or Sanger {(1977) Proc. Natl. Acad. 

■sci. USA 74:5463) . It is also contemplated that any of a 
variety of automated sequencing procedures can be 
utilized when performing the diagnostic assays ((1995) 
Bio/Technigues 19:448), including sequencing by mass 

15 spectrometry (see., e.g., PCT Publication No. WO 94/16101; 
Cohen et al. (1996) Adv. Chromatogr. 36:127-162; and 
Griffin et al. (1993) Appl. Biochem. Biotechnol. 38:147- 
159) . 

Other methods for detecting mutations in the T139 
20 gene include methods in which protection from cleavage 
agents is used to detect mismatched bases in RNA/RNA or 
RNA/DNA heteroduplexes (Myers et al. (1985) Science 
230:1242). In general, the art technique of "mismatch 
cleavage" starts by providing heteroduplexes of formed by 
25 hybridizing (labeled) RNA or DNA containing the wild-type 
T139 sequence with potentially mutant RNA or DNA obtained 
from a tissue sample. The double -stranded duplexes are 
treated with an agent which cleaves single-stranded 
regions of the duplex such as which will exist due to 
30 basepair mismatches between the control and sanple 

strands. For instance, RNA/DNA duplexes can be treated 
with RNase and DNA/DNA hybrids treated with SI nuclease 
to enzymatically digesting the mismatched regions. In 
other embodiments, either DNA/DNA or RNA/DNA duplexes can 
35 be treated with hydroxylamine or osmium tetroxide and 
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With piperidine in order to digest mismatched regions. 
After digestion of the mismatched regions, the resulting 
material is then separated by size on denaturing 
polyacrylamide gels to determine the site of mutation. 
5 See, e.g., Cotton et al (1988) Proc. Natl Acad Sci USA 
85:4397; Saleeba et al (1992) Methods Enzymol. 217:286- 
295. In a preferred embodiment, the control DNA or RNA 
can be labeled for detection. 

In still another embodiment, the mismatch cleavage 
10 reaction employs one or more proteins that recognize 
mismatched base pairs in double -stranded DNA (so called 
«DNA mismatch repair" enzymes) in defined systems for 
detecting and mapping point mutations in T139 cDNAs 
obtained from samples of cells. For example, the mutY 
15 enzyme of E. coli cleaves A at G/A mismatches and the 
thymidine DNA glycosylase from HeLa cells cleaves T at 
G/T mismatches (Hsu et al. (1994) Carcinogenesis 15:1657- 
1662) . According to an exemplary embodiment, a probe 
based on a T139 sequence, e.g.. a wild-type T139 
20 sequence, is hybridized to a cDNA or other DNA product 
from a test cell(s). The duplex is treated with a DNA 
mismatch repair enzyme, and the cleavage products, if 
any, can be detected from electrophoresis protocols or 
the' like. see. e.g., U.S. Patent No. 5,459,039. 
25 In other embodiments, alterations in 

electrophoretic mobility will be used to identify 
mutations in T139 genes. For example, single strand 
conformation polymorphism (SSCP) may be used to detect 
differences in electrophoretic mobility between mutant 
30 and wild type nucleic acids (Orita et al. (1989) Proc 
Natl. Acad. Sci USA: 86:2766; see also Cotton (1993) 
ffutat. Res. 285:125-144; and Hayashi (1992) Genet Anal 
Tech Appl 9:73-79) . Single-Stranded DNA fragments of 
sample and control T139 nucleic acids will be denatured 
35 and allowed to renature. The secondary structure of 
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single-stranded nucleic acids varies according to 
sequence, the resulting alteration in electrophoretic 
mobility enables the detection of even a single base 
change. The DNA fragments may be labeled or detected 

5 with labeled probes. The sensitivity of the assay may be 
enhanced by using RNA (rather than DNA) , in which the 
secondary structure is more sensitive to a change in 
sequence. In a preferred embodiment, the subject method 
utilizes heteroduplex analysis to separate double 

10 stranded heteroduplex molecules on the basis of changes 
in electrophoretic mobility (Keen et al. (1991) Trends 
Genet 7:5) . 

In yet another embodiment, the movement of mutant 
or wild-type fragments in polyacrylamide gels containing 
15 a gradient of denaturant is assayed using denaturing 

gradient gel electrophoresis (DGGE) (Myers et al . (1985) 
Nature 313:495) . When DGGE is used as the method of 
analysis, DNA will be modified to insure that it does not 
completely denature, for example by adding a GC clamp of 
20 approximately 40 bp of high-melting GC-rich DNA by PGR. 
In a further embodiment, a temperature gradient is used 
in place of a denaturing gradient to identify differences 
in the mobility of control and sample DNA (Rosenbaum and 
Reissner (1987) Biophys Chem 265:12753). 
25 Examples of other techniques for detecting point 

mutations include, but are not limited to, selective 
oligonucleotide hybridization, selective amplification, 
or selective primer extension. For example, 
oligonucleotide primers may be prepared in which the 
30 known mutation is placed centrally and then hybridized to 
target DNA under conditions which permit hybridization 
only if a perfect match is found (Saiki et al . (1986) 
Nature 324:163); Saiki et al. (1989) Proc. Natl Acad. Sci 
USA 86:6230). Such allele specific oligonucleotides are 
35 hybridized to PGR amplified target DNA or a number of 
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different mutations when the oligonucleotides are 
attached to the hybridizing membrane and hybridized with 

labeled target DNA. 

Alternatively, allele specific amplification 
5 technology which depends on selective PGR amplification 
may be used in conjunction with the instant invention. 
Oligonucleotides used as primers for specific 
amplification may carry the mutation of interest in the 
center of the molecule (so that amplification depends on 

10 differential hybridization) (Gibbs et al . (1989) Nucleic 
AcidB Res. 17:2437-2448) or at the extreme 3' end of one 
primer where, under appropriate conditions, mismatch can 
prevent, or reduce polymerase extension (Prossner (1993) 
Tibtech 11:238) . In addition, it may be desirable to 

15 introduce a novel restriction site in the region of the 
mutation to create cleavage-based detection (Gasparini et 
al. (1992) Mol, Cell Probes 6:1). It is anticipated that 
in certain embodiments amplification may also be 
performed using Taq ligase for amplification (Barany 

20 (1991) Proc. Natl. Acad. Sci USA 88:189). In such cases, 
ligation will occur only if there is a perfect match at 
the 3' end of the 5' sequence making it possible to 
detect the presence of a known mutation at a specific 
site by looking for the presence or absence of 

25 amplification. 

The methods described herein may be performed, for 
example, by utilizing pre-packaged diagnostic kits 
comprising at least one probe nucleic acid or antibody 
reagent described herein, which may be conveniently used, 

30 e.g., in clinical settings to diagnose patients 

exhibiting symptoms or family history of a disease or 
illness involving a T139 gene. 

Furthermore, any cell type or tissue, preferably 
peripheral blood leukocytes, in which T139 is expressed 
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may be utilized in the prognostic assays described 
herein . 

3 . Pharmacoqenomics 
5 Agents, or modulators which have a stimulatory or 

inhibitory effect on T139 activity (e.g., T139 gene 
expression) as identified by a screening assay described 
herein can be administered to individuals to treat 
(prophylactically or therapeutically) disorders (e.g., an 
10 immunological disorder) associated with aberrant T139 
activity. In conjunction with such treatment, the 
pharmacogenomics (i.e., the study of the relationship 
between an individual's genotype and that individual's 
response to a foreign compound or drug) of the individual 
15 may be considered. Differences in metabolism of 

therapeutics can lead to severe toxicity or therapeutic 
failure by altering the relation between dose and blood 
concentration of the pharmacologically active drug. Thus, 
the pharmacogenomics of the individual permits the 
20 selection of effective agents (e.g., drugs) for 
prophylactic or therapeutic treatments based on a 
consideration of the individual's genotype. Such 
pharmacogenomics can further be used to determine 
appropriate dosages and therapeutic regimens. 
25 Accordingly, the activity of T139 protein, expression of 
T139 nucleic acid, or mutation content of T139 genes in 
an individual can be determined to thereby select 
appropriate agent (s) for therapeutic or prophylactic 
treatment of the individual. 
30 Pharmacogenomics deals with clinically significant 

hereditary variations in the response to drugs due to 
altered drug disposition and abnormal action in affected 
persons. See, e.g., Linder (1997) Clin. Chem. 
43(2) -.254-266. In general, two types of pharmacogenetic 
35 conditions can be differentiated. Genetic conditions 



wo 99/54343 



PCT/US99/08896 



86 - 



15 



transmitted as a single factor altering the way drugs act 
on the body (altered drug action) or genetic conditions 
transmitted as single factors altering the way the body 
acts on drugs (altered drug metabolism) . These 
5 pharmacogenetic conditions can occur either as rare 
defects or as polymorphisms. For example, glucose-6- 
phosphate dehydrogenase deficiency (G6PD) is a common 
inherited enzymopathy in which the main clinical 
complication is haemolysis after ingestion of oxidant 
10 drugs (anti-malarials, sulfonamides, analgesics, 
nitrofurans) and consumption of fava beans. 

As an illustrative embodiment, the activity of 
drug metabolizing enzymes is a major determinant of both 
the intensity and duration of drug action. The discovery 
of genetic polymorphisms of drug metabolizing enzymes 
(e.g., N-acetyltransferase 2 (NAT 2) and cytochrome P450 
enzymls CYP2D6 and CYP2C19) has provided an explanation 
as to why some patients do not obtain the expected drug 
effects or show exaggerated drug response and serious 
toxicity after taking the standard and safe dose of a 
drug. These polymorphisms are expressed in two 
phenotypes in the population, the extensive metabolizer 
(EM) and poor metabolizer (PM) . The prevalence of PM is 
different among different populations. For example, the 
25 gene coding for CYP2D6 is highly polymorphic and several 
mutations have been identified in PM, which all lead to 
the absence of functional CYP2D6. Poor metabolizers of 
CYP2D6 and CYP2C19 quite frequently experience 
exaggerated drug response and side effects when they 
30 receive standard doses. If a metabolite is the active 
therapeutic moiety, PM show no therapeutic response, as 
demonstrated for the analgesic effect of codeine mediated 
by its cyP2D6- formed metabolite morphine. The other 
extreme are the so called ultra-rapid metabolizers who do 
35 not respond to standard doses. Recently, the molecular 



20 
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basis of ultra-rapid metabolism has been identified to be 
due to CYP2D6 gene amplification. 

Thus, the activity of T139 protein, expression of 
T139 nucleic acid, or mutation content of T139 genes in 
5 an individual can be determined to thereby select 
appropriate agent (s) for therapeutic or prophylactic 
treatment of the individual. In addition, 
pharmacogenetic studies can be used to apply genotyping 
of polymorphic alleles encoding drug-metabolizing enzymes 

10 to the identification of an individual's drug 

responsiveness phenotype. This knowledge, when applied 
to dosing or drug selection, can avoid adverse reactions 
or therapeutic failure and thus enhance therapeutic or 
prophylactic efficiency when treating a subject with a 

15 T139 modulator, such as a modulator identified by one of 
the exemplary screening assays described herein. 

4. Mnnitorina of Effect ^ Purina HI inical Trials 
Monitoring the influence of agents (e.g., drugs, 
compounds) on the expression or activity of T139 (e.g., 
20 the ability to modulate aberrant cell proliferation 

and/or differentiation) can be applied not only in basic 
drug screening, but also in clinical trials. For 
example, the effectiveness of an agent determined by a 
screening assay as described herein to increase T139 gene 
25 expression, protein levels, or upregulate T139 activity, 
can be monitored in clinical trails of subjects 
exhibiting decreased T139 gene expression, protein 
levels, or downregulated T139 activity. Alternatively, 
the effectiveness of an agent determined by a screening 
30 assay to decrease T13 9 gene expression, protein levels, 
or downregulated T139 activity, can be monitored in 
clinical trails of subjects exhibiting increased T139 
gene expression, protein levels, or upregulated T139 
activity. In such clinical trials, the expression or 
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activity of T139 and, preferably, other genes that have 
been implicated in, for example, a cellular proliferation 
disorder can be used as a "read out" or markers of the 
immune responsiveness of a particular cell. 
5 For example, and not by way of limitation, genes, 

including T139, that are modulated in cells by treatment 
with an agent (e.g., compound, drug or small molecule) 
which modulates T139 activity (e.g., identified in a 
screening assay as described herein) can be identified. 
10 Thus, to study the effect of agents on cellular 

proliferation disorders, for exait?)le, in a clinical 
trial, cells can be isolated and RNA prepared and 
analyzed for the levels of expression of T139 and other 
genes implicated in the disorder. The levels of gene 
15 expression (i.e., a gene expression pattern) can be 
quantified by Northern blot analysis or RT-PCR, as 
described herein, or alternatively by measuring the 
amount of protein produced, by one of the methods as 
described herein, or by measuring the levels of activity 
20 of T139 or other genes. In this way, the gene expression 
pattern can serve as a marker, indicative of the 
physiological response of the cells to the agent. 
Accordingly, this response state may be determined 
before, and at various points during, treatment of the 
25 individual with the agent. 

In a preferred embodiment, the present invention 
provides a method for monitoring the effectiveness of 
treatment of a subject with an agent (e.g., an agonist, 
antagonist, peptidomimetic, protein, peptide, nucleic 
30 acid, small molecule, or other drug candidate identified 
by the screening assays described herein) comprising the 
steps of (i) obtaining a pre-administration sample from a 
siibject prior to administration of the agent; (ii) 
detecting the level of expression of a T139 protein, 
35 mRNA, or genomic DNA in the preadministration sample; 
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(iii) obtaining one or more post -administration samples 
from the subject; (iv) detecting the level of expression 
or activity of the T139 protein, mRNA, or genomic DNA in 
the post -administration samples; (v) comparing the level 
5 of expression or activity of the T139 protein, mRNA, or 
genomic DNA in the pre- administration sample with the 
T139 protein, mRNA, or genomic DNA in the post 
administration sample or samples; and (vi) altering the 
administration of the agent to the subject accordingly. 

10 For example, increased administration of the agent may be 
desirable to increase the expression or activity of T139 
to higher levels than detected, i.e., to increase the 
effectiveness of the agent. Alternatively, decreased 
administration of the agent may be desirable to decrease 

15 expression or activity of T139 to lower levels than 
detected, i.e., to decrease the effectiveness of the 
agent . 

C. Methods of Treatment 
The present invention provides for both 
20 prophylactic and therapeutic methods of treating a 
subject at risk of (or susceptible to) a disorder or 
having a disorder associated with aberrant T139 
expression or activity. Such disorders include kidney 
defects such as kidney failure or hyperplasia. 

25 1. Prophvlactic Me thods 

In one aspect, the invention provides a method for 
preventing in a subject, a disease or condition 
associated with an aberrant T139 expression or activity, 
by administering to the subject an agent which modulates 

30 T139 expression or at least one T139 activity. Subjects 
at risk for a disease which is caused or contributed to 
by aberrant T139 expression or activity can be identified 
by, for example, any or a combination of diagnostic or 
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prognostic assays as described herein. Administration of 
a prophylactic agent can occur prior to the manifestation 
of symptoms characteristic of the T139 aberrancy, such 
that a disease or disorder is prevented or, 
5 alternatively, delayed in its progression. Depending on 
the type of T139 aberrancy, for example, a T139 agonist 
or T139 antagonist agent can be used for treating the 
s\ibject. The appropriate agent can be determined based 
on screening assays described herein. 

10 2. " Therapeu l- i r: Methods 

Another aspect of the invention pertains to 
methods of modulating T139 expression or activity for 
therapeutic purposes. The modulatory method of the 
invention involves contacting a cell with an agent that 
15 modulates one or more of the activities of T139 protein 
activity associated with the cell. An agent that 
modulates T139 protein activity can be an agent as 
described herein, such as a nucleic acid or a protein, a 
naturally-occurring cognate ligand of a T139 protein, a 
20 peptide, a T139 peptidomimetic, or other small raolecule. 
In one embodiment, the agent stimulates one or more of 
the biological activities of T139 protein. Examples of 
such stimulatory agents include active T139 protein and a 
nucleic acid molecule encoding T139 that has been 
25 introduced into the cell. In another embodiment, the 
agent inhibits one or more of the biological activities 
of T139 protein. Examples of such inhibitory agents 
include antisense T139 nucleic acid molecules and anti- 
T139 antibodies. These modulatory methods can be 
30 performed in vitro (e.g., by culturing the cell with the 
agent) or, alternatively, in vivo (e.g, by administering 
the agent to a subject) . As such, the present invention 
provides methods of treating an individual afflicted with 
a disease or disorder characterized by aberrant 
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expression or activity of a T139 protein or nucleic acid 
molecule. In one embodiment, the method involves 
administering an agent {e.g., an agent identified by a 
screening assay described herein) , or combination of 
5 agents that modulates (e.g., upregulates or 

downregulates) T139 expression or activity. In another 
embodiment, the method involves administering a T139 
protein or nucleic acid molecule as therapy to compensate 
for reduced or aberrant T139 expression or activity. 

10 Stimulation of T139 activity is desirable in 

situations in which T139 is abnormally downregulated 
and/or in which increased T139 activity is likely to have 
a beneficial effect. Conversely, inhibition of T139 
activity is desirable in situations in which T139 is 

15 abnormally upregulated and/or in which decreased T139 
activity is likely to have a beneficial effect. 

This invention is further illustrated by the 
following examples which should not be construed as 
limiting. The contents of all references, patents and 

20 published patent applications cited throughout this 
application are hereby incorporated by reference. 



EXAMPLES 



25 



Example 1: T^nlation a nd Characterization of Human T139 
cDNA 

RNA was isolated from human fetal kidney tissue, 
and the polyA+ fraction was purified using Oligotex beads 
(Qiagen) . Three micrograms of polyA+ RNA were used to 
synthesize a cDNA library using the Superscript cDNA 
Synthesis kit (Gibco BRL; Gaithersburg , MD) . 
30 Complementary DNA was direct ionally cloned into the 

expression plasmid pMET7 using the Sail and NotI sites in 
the polylinker to construct a plasmid library. 
Transformants were picked and grown for single-pass 
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sequencing. One cDNA clone ( jthKall5e09) was identified 
that encoded a protein with homology to testis-specific 
protein-l (TPX-1) , an acrosomal sperm protein that is a 
member of the SCP-like family of cysteine-rich secreted 
5 proteins. JthKall5e09 contains an open reading frame of 
446 amino acids, which is referred to as "Tango 139". 

Example 2: »t-r-ibntion TT^g mRNA in Human Tissues 

The expression of T139 was analyzed using Northern 
blot hybridization. Oligonucleotide primers (5' 
10 CCATGCTGCATCCAGAG 3' (SEQ ID NO: 14); 5' 

CACAGACAAAGGCTTCTATC 3' (SEQ ID NO: 15)) were used to 
amplify a 543 bp fragment from the coding region of 
jthKall4e09. and the DNA was radioactively labeled with 
«P-dCTP using a Prime-It kit (Stratagene, La Jolla, CA) 
15 according to the supplier's instructions. Filters 

containing human mRNA (MTNI and MTNII from Clontech, Palo 
Alto, CA) were probed in ExpressHyb hybridization 
solution (Clontech) and washed at high stringency 
according to manufacturer's recommendations. 
20 Tango 139 is expressed at high levels as a 

transcript of about 2.0 kb in the kidney, with lower 
levels in the testis. In addition, there are additional 
transcripts in both kidney and testis at about 2.4 and 
3.5 kb. No other tissues examined (heart, brain, 
25 placenta, lung, liver, skeletal muscle, pancreas, spleen, 
thymus, ovaries, small intestine, colon and peripheral 
blood leukocytes) showed any expression. 

Example 3: r-h^r-anteriz ^^i on of T139 Proteins 

In this example, the predicted amino acid sequence 
30 of human T139 protein was compared to amino acid 
sequences of known proteins and various motifs were 
identified. In addition, the molecular weight of the 
human T139 proteins was predicted. 
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The human T139 cDNA isolated as described above 
(Figure 1; SEQ ID N0:1) encodes a 446 amino acid protein 
(Figure 1; SEQ ID NO: 2) . The signal peptide prediction 
program SIGNAL? (Nielsen et al. (1997) Protein 
5 Engineering 10:1-6) predicted that T139 includes a 26 
amino acid signal peptide (amino acid 1 to about amino 
acid 26 of SEQ ID N0:2) preceding the 420 amino acid 
mature protein (about amino acid 27 to amino acid 446; 
SEQ ID N0:4) . A hydropathy plot of T139 is presented in 
10 Figure 3. This plot shows the location of cysteines 
("cys"; short vertical lines just below plot) and the 
PFAM identifiers (PF00188, PF00008. and PF00059; bars 
just above plot) . For general information regarding PFAM 
identifiers refer to Sonnhammer et al. (1997) Protein 
15 28:405-420 and http://www.psc.edu/general/software/ 
packages/pf am/pf am.html . 

AS shown in Figure 2A, T139 has a region of 
homology (amino acids 47 to 190. of SEQ ID N0:2; SEQ ID 
NO: 5) to a SCP-like domain consensus sequence (PF00188, 
20 of SEQ ID NO: 2; SEQ ID NO: 9). Figure 2B shows the region 
of homology (amino acids 297 to 412, SEQ ID N0:2; SEQ ID 
NO: 6) to the C-type lectin domain consensus sequence 
(PF00059, SEQ ID N0:10). Although significant homology 
was observed, the four cysteines in this region of T139 
25 do not match the four conserved cysteines in the 

consensus sequence; the alignment only recognized three 
of the four cysteines in this region of T13 9 as identical 
to the consensus cysteines. Figure 2C shows the regions 
of homology (amino acids 232 to 260 of SEQ ID NO: 2; 
30 (EGFl) and 264 to 291 of SEQ ID N0:2; (EGF2) ; SEQ ID 

N0S:7 and 8, respectively) to a EGF-like domain consensus 
sequence (PF00008, SEQ ID N0:11) . Although both the EGFl 
and EGF2 domains contain six cysteines as does the 
consensus, the alignment only recognizes five cysteines 
35 as matching the consensus. Mature T139 has a predicted 
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MW of 49 kDa (47 kDa with the signal peptide removed) , 
not including post-translational modifications. A signal 
peptide is predicted to exist from amino acids 1 to 26, 
using the prediction program SIGNALP (Nielsen et al. 
5 (1997) Protein Engineering 10:1-6). 

AS noted above, T139 is homologous to testis- 
specific protein-1 (TPX-l; Kasahara et al. (1989) 
Genomics 5:527-34) . TPX-l is a testis-specif ic protein 
that is expressed by spermatogenic cells and is thought 
10 to cause spermatogenic cells to bind to somatic Sertoli 
cells. Sertoli cells are essential for the 
proliferation, differentiation, and survival of 
spermatogenic cells. Human TPX-l is a member of the 
CRISP family. Members of this family have a cysteine - 
15 rich C-terminal region having the pattern CXaCXeCXgCXgC 
(Foster and Gerton (1996) Mol. Reprod. Dev. 44:221-29; 
Kratzch et al . (1996) Eur. J. Biochem. 236:827-36). 
Another member of this family is acidic epididymal 
glycoprotein (AEG) . This protein is present on the 
20 surface of the plasma membrane at the sperm head and is 
thought to be involved in sperm-egg fusion (Brooks and 
Tiver (1983) J- Reprod. Fertil. 69:650-57). 

Based on the homology between T139 and TPX-l and 
the expression of T139 in testis, T139 may play a role in 
25 spermatogenesis and other testicular functions. 

Accordingly, T139 nucleic acids, polypeptides, and 
modulators of T139 expression or activity can be used to 
modulate spermatogenesis or sperm-egg fusion. In 
addition, such molecules may be useful for treatment of a 
variety of testicular disorders, e.g., testicular cancer. 



30 



Example 4: Pr-f>naration of T139 P roteins 

Recombinant T129 can be produced in a variety of 
expression systems. For example, the mature T129 peptide 
can be expressed as a recombinant glutathione- S- 
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transferase (GST) fusion protein in E. coli and the 
fusion protein can be isolated and characterized. 
Specifically, as described above, T129 can be fused to 
GST and this fusion protein can be expressed in E. coli 
strain PEB199. Expression of the GST-T129 fusion protein 
in PEB199 can be induced with IPTG. The recombinant 
fusion protein can be purified from crude bacterial 
lysates of the induced PEB199 strain by affinity 
chromatography on glutathione beads. 



10 E quivalents 

Those skilled in the art will recognize, or 
able to ascertain using no more than routine 
experimentation, many equivalents to the specific 
embodiments of the invention described herein. Sue 

15 equivalents are intended to be encompassed by the 
following claims. 
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What is claimed is: 

1. An isolated nucleic acid molecule selected 
from the group consisting of: 

a) a nucleic acid molecule comprising a 

5 nucleotide sequence which is at least 55% identical to 
the nucleotide sequence of SEQ ID N0:1 or SEQ ID NO: 3, 
the cDNA insert of the plasmid deposited with ATCC as 
Accession Number 98694, or a complement thereof; 

b) a nucleic acid molecule coirprising a fragment 
10 of at least 250 nucleotides of the nucleotide sequence of 

SEQ ID N0:1 or SEQ ID NO: 3, the cDNA insert of the 
plasmid deposited with ATCC as Accession Number 98694, or 
a complement thereof; 

c) nucleic acid molecule which encodes a 

15 polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2 or SEQ ID NO: 4 or an amino acid sequence encoded by 
the cDNA insert of the plasmid deposited with ATCC as 
Accession Number 98694; 

d) a nucleic acid molecule which encodes a 
20 fragment of a polypeptide comprising the amino acid 

sequence of SEQ ID NO: 2 or SEQ ID N0:4, wherein the 
fragment comprises at least 15 contiguous amino acids of 
SEQ ID NO: 2 or SEQ ID NO: 4 or the polypeptide encoded by 
the cDNA insert of the plasmid deposited with ATCC as 
25 Accession Number 98694; and 

e) a nucleic acid molecule which encodes a 
naturally occurring allelic variant of a polypeptide 
comprising the amino acid sequence of SEQ ID N0:2 or SEQ 
ID NO: 4 or an amino acid sequence encoded by the cDNA 

30 insert of the plasmid deposited with ATCC as Accession 
Number 98694, wherein the nucleic acid molecule 
hybridizes to a nucleic acid molecule comprising SEQ ID 
N0:1 or SEQ ID NO: 3 under stringent conditions. 
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2. The isolated nucleic acid molecule of claim 
1, which is selected from the group consisting of: 

a) a nucleic acid conprising the nucleotide 
sequence of SEQ ID N0:1 or SEQ ID NO: 3, or the cDNA 

5 insert of the plasmid deposited with ATCC as Accession 
Number 98694, or a complement thereof; and 

b) a nucleic acid molecule which encodes a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2 or SEQ ID NO: 4 or an amino acid sequence encoded by 

10 the cDNA insert of the plasmid deposited with ATCC as 
Accession Number 98694. 

3. The nucleic acid molecule of claim 1 further 
comprising vector nucleic acid sequences. 

4. The nucleic acid molecule of claim 1 further 
15 comprising nucleic acid sequences encoding a heterologous 

polypeptide. 

5. A host cell which contains the nucleic acid 
molecule of claim 1. 

6. The host cell of claim 4 which is a mammalian 
20 host cell. 

7. A non-human mammalian host cell containing 
the nucleic acid molecule of claim 1. 

8. An isolated polypeptide selected from the 

group consisting of : 
25 a) a fragment of a polypeptide comprising the 

amino acid sequence of SEQ ID NO: 2 or SEQ ID N0:4, 
wherein the fragment comprises at least 15 contiguous 
amino acids of SEQ ID NO: 2 or SEQ ID NO: 4; 
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b) a naturally occurring allelic variant of a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2 or SEQ ID NO: 4 or an amino acid sequence encoded by 
the cDNA insert of the plasmid deposited with ATCC as 

5 Accession Number 98694, wherein the polypeptide is 

eiicoded by a nucleic acid molecule which hybridizes to a 
nucleic acid molecule comprising SEQ ID N0:1 or SEQ ID 
NO: 3 under stringent conditions; 

c) a polypeptide which is encoded by a nucleic 
10 acid molecule comprising a nucleotide sequence which is 

at least 55% identical to a nucleic acid comprising the 
nucleotide sequence of SEQ ID N0:1 or SEQ ID NO: 3. 

9. The isolated polypeptide of claim 8 
comprising the amino acid sequence of SEQ ID NO: 2 or SEQ 

15 ID NO: 4 or an amino acid sequence encoded by the cDNA 
insert of the plasmid deposited with ATCC as Accession 
Number 98694. 

10. The polypeptide of claim 8 further comprising 
heterologous amino acid sequences. 

20 11. An antibody which selectively binds to a 

polypeptide of claim 8. 

12 . A method for producing a polypeptide selected 

from the group consisting of: 

a) a polypeptide comprising the amino acid 

25 sequence of SEQ ID NO: 2 or SEQ ID NO: 4 or an amino acid 
sequence encoded by the cDNA insert of the plasmid 
deposited with ATCC as Accession Number 98694; 

b) a fragment of a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 2 or SEQ ID NO: 4 or an 

30 amino acid sequence encoded by the cDNA insert of the 
plasmid deposited with ATCC as Accession Number 98694, 
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Wherein the fragment comprises at least 15 contiguous 
amino acids of SEQ ID NO: 2 or SEQ ID NO: 4 or an amino 
acid sequence encoded by the cDNA insert of the plasmid 
deposited with ATCC as Accession Number 98694; and 

5 c) a naturally occurring allelic variant of a 

polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2 or SEQ ID NO: 4 or an amino acid sequence encoded by 
the cDNA insert of the plasmid deposited with ATCC as 
Accession Number 98694, wherein the polypeptide is 
10 encoded by a nucleic acid molecule which hybridizes to a 
nucleic acid molecule comprising SEQ ID N0:1 or SEQ ID 
NO: 3 under stringent conditions; 

comprising culturing the host cell of claim 5 
under conditions in which the nucleic acid molecule is 

15 expressed. 

13. The isolated polypeptide of claim 8 
comprising the amino acid sequence of SEQ ID NO: 2 or SEQ 
ID NO: 4 or an amino acid sequence encoded by the cDNA 
insert of the plasmid deposited with ATCC as Accession 

20 Number 98694. 

14. A method for detecting the presence of a 
polypeptide of claim 8 in a sample, comprising: 

a) contacting the sample with a compound which 
selectively binds to a polypeptide of claim 8; and 
25 b) determining whether the compound binds to the 

polypeptide in the sample. 

15. The method of claim 14, wherein the compound 
which binds to the polypeptide is an antibody. 

16. A kit comprising a compound which selectively 
30 binds to a polypeptide of claim 8 and instructions for 

use . 
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17. A method for detecting the presence of a 
nucleic acid molecule of claim 1 in a sample, comprising 

the steps of: 

a) contacting the sample with a nucleic acid 
5 probe or primer which selectively hybridizes to the 

nucleic acid molecule; and 

b) determining whether the nucleic acid probe or 
primer binds to a nucleic acid molecule in the sample. 

18. The method of claim 11, wherein the sample 
10 comprises mRNA molecules and is contacted with a nucleic 

acid probe. 

19. A kit comprising a compound which selectively 
hybridizes to a nucleic acid molecule of claim 1 and 
instructions for use. 

20. A method for identifying a compound which 
binds to a polypeptide of claim 8 comprising the steps 
of: 

a) contacting a polypeptide, or a cell 
expressing a polypeptide of claim 8 with a test compound; 

20 and 

b) determining whether the polypeptide binds to 
the test compound. 

21. The method of claim 20, wherein the binding 
of the test compound to the polypeptide is detected by a 

25 method selected from the group consisting of: 

a) detection of binding by direct detecting of 
test compound/polypeptide binding; 

b) detection of binding using a competition 
binding assay; 

30 c) detection of binding using an assay for EGF 

or calcium-dependent carbohydrate recognition activity. 
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22. A method for modulating the activity of a 
polypeptide of claim 8 comprising contacting a 
polypeptide or a cell expressing a polypeptide of claxm 8 
with a compound which binds to the polypeptide in a 
sufficient concentration to modulate the activity of the 
polypeptide . 

23 A method for identifying a compound which 
modulates the activity of a polypeptide of claim 8. 

comprising: .^u = 

a) contacting a polypeptide of claim 8 with a 

test compotind; and 

b) determining the effect of the test compound 
on the activity of the polypeptide to thereby identify a 
compound which modulates the activity of the polypeptide. 



wo W/54343 



1 / 6 



PCT/US99/08896 



SacCCAACAGACCC ATG CTG CAT CCA GAG ACC T.- .CT ^ 

20 
40 

^ 60 
80 
100 

TTT GCA GAG GGG CAG CGG TAG AGO CAC 

s s! s ?n s; ss ?5 i s ?s s «v se. 

120 

^ ^^r. ^.nr Trr ACC CAC TAG ACG CAG CTC GTG TGG GCC 

s s s? s s ^ s| s s ^« V.X ^. 

140 

r^r^r. n^n rrr TGC TCT GCA GGC CAG GCA GCG ATA GAA 

S S ^ - S ?S S i - - - - o., ax. ^ n. ..u 

160 -"-^^ 
180 

CCC T« «a «^ |=T JOC TCT ™ CTC TC^ gC 52 ?S J2 ^ 

Pro Tyr Lys Lys Gly Ala Trp Cys ser i.eu . 215 
200 •^"^ 

220 225 
240 

— ™ SI! S S S S §fn Si 52 IS S S |S SS 

Gly Arg Tyr Cys Gin vax Arg ^y^. ^ 275 
260 

280 285 

300 -^"^ 
320 



202 



262 



322 



382 



442 



502 



562 



622 



682 



742 



802 



862 



922 



982 



1042 



1102 



FIG. 1 
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- ^^r. ^nr. RTr r^r GCC TTC TAT CTG GGC CGC CTG GAG ACC ACC 
j^rC AAG AGC CAG AAA GTG CAG GAC ATC CTC GCC TTC TA^ 

lie Lys Ser Gin Lys Vai Gin asp ^50 355 

340 

ni^n nrr AGG AAC TTC TGG ATC GGG CTC ACC TAG AAG 

5f» S S S=p ^ - i| S S ^-^-51 «^ - ^ S-vs 

360 

^^r. nnn ^nr GCC ACA GGG GAG CAC CAG GCC TTC ACC AGT TTT GCC 

ACC GCC AAG GAC TCC TTC CGC TGG GCC ACA 
Thr Ala Lys Asp Ser Phe Arg Trp Ala xnr ^xy 
380 

^ ^Tv*- TTT GGC AAC TGC GTG GAG CTG CAG GCT TCA GCT GCC 

TTT GGG CAG CCT GAC ^AC CAC GGG TTT GGC AAC TGC 

Phe Gly Gin Pro Asp Asn Hxs Gly Phe Giy ash 

400 



GAG CAC ATC TCC CGG TGG GGC CCA GGG TCC TGA 
lie ser Arg Trp Gly Pro Gly Ser * 
440 



1162 



1222 



1282 



1342 



1402 



1435 




FIG. 1 (cont.) 
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SEQUENCE LISTING 
(1) GENERAL INFORMATION 
(i) APPLICANT: Holtzman, Douglas 

Ki) TITLE OF THE INVENTION: NOVEL MOLECULES OF THE T139-RELATED 
(il) TITLE O^^^^jj p^3.LY AND USES THEREOF 

(iii) NUMBER OF SEQUENCES: 4 

(iv) CORRESPONDENCE ADDRESS: 

^ Ta) ADDRESSEE: Fish & Richardson P . C . 

(B) STREET: 225 Franklin Street 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02110-2804 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Coitipatible 
tr\ OPERATING SYSTEM: Windows95 

S) s5??S: FastSEQ for Windows Versxon 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 23-APR-1998 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

fix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617/542-5070 

(B) TELEFAX: 617/542-8906 

(C) TELEX: 200154 

(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I^GTH: 1856 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 95... 1432 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

..a^ccc^ SSS ^^crf^^^^c 

TGTGCACGGA GTGCCTGACG GGCCCAACAt. AV.v.i, 

1 5 
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10 '^^ 

^r. ^nn rPA CCC CAG CTG CAG GAG CAG GCT CCG ATG 

S its fa? S "^^ 

25 

7^nr- rAP AOT TTC TTG CTC CTC TCC CTG CAC 

S SS 5S - ™ - - 'S 

40 

Tnr rrr CAG CCC CCT GCG GCT GAC ATG CGG AGG 

S SS S S S 5S «o H.t «g 

60 

A^^n rrr GCC CAA CTG GCT CAA GCC AGG GCA GCC 
S S - S S »5| 

75 

or* arr CCG AGO CTG GCG TCC GGC CTG TGG CGC ACC 

S= S ^" S 1 ^" 

90 

S5 s s s as S SS S IS s ss 

105 

r-TA TW TTT GCA GAG GGG CAG CGG TAC AGC 
SI S as ?S ?S Si S ?S gu 01, CI. xy. S,. 

X20 

155 

^ ^ »a 0« go AT. GJC -2 =TJ T3T OCC «C TCC 

Cys Ser Ala Gly Gin Ala Ala lie oiu ax 
170 ^'^ 

CCC GO. CCC «c ™= go ore «c «5 ?S S ??S S ^ ^ 

Pro Gly Gly Asn Trp Glu Val Asn Qiy i-y 
185 

200 205 

„„r, r.rv r^r TGT GAG GTC CCC AGG AAT CCT 

- S S SI S S I?? g SS V.1 P.O P.O 

^ 220 ^"'^ 



163 
211 
259 
307 
355 
\ 403 
451 
499 
547 
595 
643 
691 



739 



787 



835 



883 
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r^^n nrr CGG TTC CC3G GAG GAG GAG TGC TCG 
TGC A6C CTG CAG TGT GTG gC GGC CGG TTC COG 
Cys Ser Leu Gin Cys Val His Giy Arg s 

265 

- - - r S ?S - SI S S 55? g 

Cys Val Cys Asp He Giy ly^ ^ ^90 295 

280 

300 

P^. GAc ACC TAT TAC AGA GCC AGG ATG AAA TGT 

315 ^"^^ 

330 

345 

r^r. r^i^n am 7VC3G AAC TTC TGG ATC GGG CTC ACC TAC 
ATT GAC AGT GAC TTC GAG ACC AGG AAC TT^ 

He Asp Ser Asp Pne fciAu iiu. ^^^j 375 

360 

»CC OCC TCC ™ TOO OCC ^ ^ g= «C ^ 

Lys Thr Ala Lys Asp Ser Phe Arg rrp 

^ 380 "^'^^ 

rvv^ ri^r rCT GAC AAC CAC GGG TTT GGC AAC 

™ S ?| S SI S? |I 2^ - -V oiv 
™ r ^ SI IS SI ?f. i^. S 5S S 

cys Val Glu Leu Gin Ala S.er aj.^ «^ 
410 

^ wi.r' aTf TfiC CAG TTT GCC CAG GAG CAC ATC 
TGC AAA ACC CGA AAC CGT TAC ATC TGC CAG TTT ^.^ 
Cys Lys Thr Arg Asn Arg Tyr He cys o 



425 "0 



XCC CGG TGG GGC CCA GGG TCC TGAGGCCTGA CCACATGGCT CCCTCGCCTG CCCT 
ser S Xrp Giy Pro Giy Ser 
440 

COGAGCACCG GCTCTGCTTA CCT^CCGCC CACCTGTCTG 

??^TGCCT CATGTCCAAA GMGTCT^ otSSStA GAAGAAGCTG GGGCCCTTCG 
AGAGGCAGGG AGGC^GTGA ^GGCCAGGGA AAGGAGAGGA CACCGCCAGT 

cctgcttttg attgggaaga tgggcttcaa gtggggcagc ggagcttccc 

J^ISaaaaa aaaaaaaaaa gggcggccgc 



931 
979 
1027 
1075 
1123 
1171 
1219 
1267 
1315 
1363 
1411 
1466 



1526 
1586 
1646 
1706 
1766 
1826 
1856 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 446 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ill MOLECULE TYPE, protein 
(V) FEMMEBT TYPBi intenul 
ixi) SEQUENCE DBSCRimOH. SEQ ID 110:2. 
L=„ His pro «lu Thr S« P.O Gly Arg <=ly Hi, L.. Le» Ma V,X 
Lei Leu Me Leu Let^oiy Thr Tr| Ma Glu val Trp Pro Pro Gin 
Leu Oln =1» Gin "a Pro Met M. Gly Ma Leu «n »r| Lye Glu s.r 
P.e Leu £ Leu Her Leu His ^n Leu Eer Trp V.l Gin Pro 
Pro M° Ma «P Met «9 Leu Asp Trp Ser «p Her Leu Ma Gin 
Ma Gin Ma «5 mII Ma Leu Cys Gly He Pro Thr Pro Ser Leu 
^. ser Gly Leu W »r9 Thr Leu Gin Val Gly Trp »sn Met Gin Leu 
L.U pro Ma lly Leu Ma Ser P|. Val Glu Val val s.r Leu Trp Phe 
„e Glu Gl. ^ Tyr ser Hi. Ma Ma Gly Glu cy. M. »r, 
M. S cys Thr His Tyr iSr Gin Leu val Trp Ma Thr Ser Ser Gin 
Gly cy, Gly »rg S2 Leu Cy, ser M, Gly Gin M. Ma lie Glu 
M. Ph. val cy. S Tyr Ser Pro Gly Gly ».n Trp Glu V|l »sn Gly 
Ly. ^ He "° P» Tyr Ly. Ly. Gly Ma Trp Cy. Ser 1.U cy. Thr 
^e ser HI ser Gly Cy. Phe Ly. Ma Trp «P Bis Ma Gly Gly Leu 
Cs val pro ^ ^ P~ cy. Met ser cy. Gl. «n His Gly 
'^l Leu «n Ue Ser Cys His Cy. Hi, Cy. Pro Pro Gly Tyr Thr 
Gly Tyr cy. Iln val Ar, cy. ser Gl. cys v.l Hi, Gly Ar. 
P.. ^ Glu Glu cy, ser Cy. val C^ AaP Ha -^V ^ 

Gin S5 Ma Thr Ly, Val Hi. Phe Pro Phe Hi, Thr Cy, A.p Leu 
ill ASP Gly ASP cy, pL Met val S.r ser Glu Ma An, Thr Tyr 
^ M. Ar, Met cy. Gin Ar, Ly. Gly Gly val Leu Ma Gin 
XI. Ly. ser Gin Ifs val Gin Asp .J- Le. Ma Phe Tyr Leu Gly Are 

^1 , ^7^^ Tie AsD ser Asp Phe Glu Thr Arg Asn 
Leu Glu Thr Thr Asn Glu Val lie Asp ber p 

Phe Trp III Oly I^u Thr Tyr Lys Thr Ala Lys Asp Ser Phe Arg Trp 
Ala Gly Glu His Gin Ala Phe Thr Ser Phe Ala Phe Gly Oln Pro 
Asn His Gly Phe ^l^ Asn Cys Val Glu Leu .In Ala Ser Ala Ala 
Phe Asn Trp ;.n t^sl Gin Arg C^s Jys Thr Arg Asn Arg Tyr Xle Cys 
Ola Phe Ala HI Glu His He Ser Arg Trp Gly Pro Gly Ser 
435 
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(2) INFORMATION FOR SEQ ID NO: 3: 

ii) SEQUENCE CHARACTERISTICS: 
^ (A) KtH: 1338 base paxrs 
IB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 3 



ATGCTGCATC 
CTTGGCACCG 
GGAGCCCTGA 
TGGGTCCAGC 
CTGGCTCAAG 
TGGCGCACCC 
GTTGAAGTGG 
TGTGCTCGCA 
CTGGGCTGTG 
GCCTACTCCC 
GGTGCCTGGT 
GCAGGGGGGC 
CGTCTCAACA 
CAAGTGAGGT 
GTCTGTGACA 
ACCTGTGACC 
TACAGAGCCA 
AAAGTGCAGG 
GACAGTGACT 
TCCTTCCGCT 
GACAACCACG 
AACCAGCGCT 
CGGTGGGGCC 



CAGAGACCTC 
CCTGGGCAGA 
ACAGGAAGGA 
CCCCTGCGGC 
CCAGGGCAGC 
TGCAAGTGGG 
TCAGCCTATG 
ACGCCACCTG 
GGCGGCACCT 
CCG6AGGCAA 
GTTCGCTCTG 
TCTGTGAGGT 
TCAGCACCTG 
GCAGCCTGCA 
TCGGCTACGG 
TGAGGATCGA 
GGATGAAATG 
ACATCCTCGC 
TCGAGACCAG 
GGGCCACAGG 
GGTTTGGCAA 
GCAAAACCCG 
CAGGGTCC 



CCCTGGCCGG 
GGTGTGGCCA 
GAGTTTCTTG 
TGACATGCGG 
CCTCTGTGGA 
CTGGAACATG 
GTTTGCAGAG 
CACCCACTAC 
GTGCTCT6CA 
CTGGGAGGTC 
CACAGCCAGT 
CCCCAGGAAT 
CCACTGCCAC 
GTGTGTGCAC 
GGGAGCCCAG 
CGGAGACTGC 
TCAGAGGAAA 
CTTCTATCTG 
GAACTTCTGG 
GGAGCACCAG 
CTGCGTGGAG 
AAACCGTTAC 



GGGCATCTCC 
CCCCAGCTGC 
CTCCTCTCCC 
AGGCTGGACT 
ATCCCAACCC 
CAGCTGCTGC 
GGGCAGCGGT 
ACGCAGCTCG 
GGCCAGGCAG 
AACGGGAAGA 
GTCTCAGGCT 
CCTTGTCGCA 
TGTCCCCCTG 
GGCCGGTTCC 
TGTGCCACCA 
TTCATGGTGT 
GGCGGGGTGC 
GGCCGCCTGG 
ATCGGGCTCA 
GCCTTCACCA 
CTGCAGGCTT 
ATCTGCCAGT 



TGGCTGTGCT 
AGGAGCAGGC 
TGCACAACCG 
GGAGTGACAG 
CGAGCCTGGC 
CCGCGGGCTT 
ACAGCCACGC 
TGTGGGCCAC 
CGATAGAAGC 
CAATCATCCC 
GCTTCAAAGC 
TGAGCTGCCA 
GCTACACGGG 
GGGAGGAGGA 
AGGTGCATTT 
CTTCAGAGGC 
TGGCCCAGAT 
AGACCACCAA 
CCTACAAGAC 
GTrTTGCCTT 
CAGCTGCCTT 
TTGCCCAGGA 



CCTGGCCCTC 
TCCGATGGCC 
CCTGCGCAGC 
CCTGGCCCAA 
GTCCGG CCTG 
GGCGTCCTTT 
GGCAGGAGAG 
CTCAAGCCAG 
CTTTGTCTGT 
CTATAAGAAG 
CTGGGACCAT 
GAACCATGGA 
CAGATACTGC 
GTGCTCGTGC 
TCCCTTCCAC 
AGACACCTAT 
CAAGAGCCAG 
CGAGGTGATT 
CGCCAAGGAC 
TGGGCAGCCT 
CAACTGGAAC 
GCACATCTCC 



(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 420 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 



Glu val Trp Pro Pro Gin Leu Gin Glu Gin Ala Pro 
.ei Asn Arg Lys Glu Ser Phe Leu Leu Leu Ser Leu 
ATS ser Trp Va! Gin Pro Pro Ala Ala Asp Met Arg 
ser ASP Ser Leu Ala Gin Leu Ala Gin Ala Arg Ala 
lie PrS Thr Pro Ser Leu Ala Ser Gly Leu Trp Arg 
o'ly Trp Asn Met Gin Z Leu Pro Ala Gly Leu Ala 
val val ser Leu T^ Phe Ala Glu Gly Gin Arg Tyr 
Gly Glu Cys Ma Arg Asn Ala Thr Cys Thr His Tyr 
115 -^^^ 



Met Ala 



His Asn 
30 

Arg Leu 

45 
Ala Leu 

Thr Leu 

Ser Phe 



Gly Ala 

15 
Arg Leu 

Asp Trp 

Cys Gly 

Gin Val 
80 

Val Glu 
95 

Ala Ala 



Ser His 
110 

Thr Gin Leu Val 
125 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1338 
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Trp Ala Thr Ser Ser Gin Leu Gly Cys Gly Arg His Leu Cys Ser Ala 
Gly III Ala Ala He Glu Ala Phe Val Cys Ala Tyr Ser Pro Gly Gly 
111 Xrp Glu val Asn g!J Lys Thr He lie Pro Tyr Lys Lys Gly Ala 
Trp cys ser Leu ^s Thr Ala Ser Val Ser Gly. Cys Phe Lys Ala Trp 
.sp His Ala "ly Gly Leu cys Glu Val Pro Arg Asn Pro Cys Arg Met 
ser cys G^n Asn His Gly Arg Leu Asn lie Ser Thr Cys His Cys His 
cys P" pro Gly Tyr Thr Gly Arg Tyr Cys Gin Val Arg Cys Ser Leu 
cys val His Gly ^ Phe Arg Glu Glu Glu Cys Ser Cys Val Cys 
;^p lie Gly Tyr gS Gly Ala Gin Cys Ala Thr Lys Val His Phe Pre 
Phe His Thr ASP Leu Arg lie Asp Gly Asp Cys Phe Met Val Ser 
ser Glu Sa Asp Thr Tyr Tyr Arg Ala Arg Met Lys Cys Gin Arg Lys 
Oly III val Leu Ala Gin He Lys Ser Gin Lys Val Gin Asp He Leu 
m Phe Tyr Leu Gly ^g Leu Glu Thr Thr Asn Glu Val lie Asp Ser 
ASP Phe Glu Thr S Asn Phe Trp lie Gly Leu Thr Tyr Lys Thr Ala 
.ys ASP ser pie Arg Trp Ala Thr Gly Glu His Gin Ala Phe Thr Ser 
Phe Ala Phe Gly Gin Pro Asp Asn His Gly Phe Gly Asn Cys Val Glu 
.eu III Ala ser Ala Ala Phe Asn Trp Asn Asn Gin Arg Cys Lys Thr 
S Asn Arg Tyr He gs Gin Phe Ala Gin Glu His He Ser Arg Trp 
405 

Gly Pro Gly Ser 
420 



